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ON CERTAIN KEY PROBLEMS OF INFORMATION 



SCIENCE. 

(PREFACE) 

The collection of articles «F'robloms of Information Sci- 
encc» (FID 478) is the second publication of this kind is- 
sued by the FID Research Commi-ttee ^Theoretical Basis of 
Information». (FID/Rl). The first collection had a similar 
title — «Theoretical Problems of rnf'ormatics» (FID 435) 
and came off the press in 1968. 

The present collection consists of 12 papers written hy 
15 authors from 9 countries. The collection embraces most 
of the crucial problems facing modern information scien- 
ce and scientific information activities, ranging from in- 
formation needs of scientists and specialists to the specific 
aspects of computerixa'tion of various information proces- 
ses. 

The iirst paper «Studies of Information Needs» is by P. 
Atanasiu (Socialist Republic of Rumania). It reviews the 
, main methods of studying this important problem of mo- 
: dern information science. In recent years, however, there 
bas been a tendency towards a relative decrease in the 
: number of studies devoted to this problem. For instance, in 
' November 1961 «Current Research and Development in 
• Scientific Documentation» reported 39 projects or studies 
: of information needs and uses; in November 1958—1962, 
: in November 1958—1964, in 1961—1966, in 1943—1969, 
: while the overall number" of -research projects in informa- 
tion science and technology reported in «Current Rese- 
■ srch and Development in Scientific Documentation» during 
^ this period increased from 271 to 785, i. e. almost three- 
; fold. 



^ H can be cxphiincd by the fact that academic rcscarcb 
into informaHon needs, xlcspilc its tremendous difficiiily 
and complexity, has evidently failed to juslify the hopes 
that had been placed on it. 

On the one hand, these studies have made it possible 
to identify the types of inlorrnation needs and the catego- 
ries of inlorrnation users which had been largely known by 
the practical experience in publishing scientific and tech- 
nical literature and by the experience ^gained in informa- 
tion and library-bibliographic work. On the other hand, 
these studies have shown that each particular information 
need is a very complex and many-sided phenomenon. It 
was discovered that each information need is, in a sense, 
unique, for it depends on the user^s professional backgro- 
und, his psychological disposition, age, the task facing him 
and many other factors. 

I:t folbws from the unique character of each informa- 
tion need that, in order to raise the efficiency of informa- 
tion service rendered to scientists and speciaiists, one sho- 
uld strive, wherever possible, to provide the information 
system with a capability of feedback with the user. Such a 
feedback system opens up broad opportunities for thoro- 
ugh study of information needs of scientists and spccia- 
hsts. It confirms the conclusion that at the present stage of 
infonnalion science development the main source of cog- 
niiion of (lie regularities of communication in science is the 
information practice. 

It should also be mentioned that studies of information 
needs and of their dynamics have been gaining importance 
as an approach to exploring the processes of creative work, 
i. e. they are actually outstepping the scope of information 
science. 

The subject discussed in the paper by A. Merta (Cze- 
choslovakia) covers informal methods of scientific commu- 
nica'tion. For the past 5 to 7 years it has been the main 
subject of his research. Taking advantage of this oppor- 
tunity, 1 should like to express in this preface my own opi- 
nion on the general approach to the studies of informal and 
formal channels of scientific communication. 

To begin with, one can hardly agree to any — explicit 
or implicit — contraposition of informal channels of scien- 
tific communication to 'the formal ones or to declarations 
to the effect that informal channels are more important 



than formal ones. When speaking of the si/slem of scienti- 
fic communication we should never forget that both infor- 
mal and formal channels arc organic components of one 
and the same system. In this system they perform diffe- 
rent functions and, therefore, possess different properties. 

A most important attribute of a system is that it pos- 
sesses a certain iniegraUlij, i. e. 'that the entropy of the to- 
tality of the component making up the system is less than 
that of a simple aggregate of interconnected and inteiac- 
'ting objects and phenomena. If the definitive characteristic 
of a system were merely the presence of interconnection 
and interaction between objects and phenomena, any set 
of these could be called a system, since all objects and phe- 
nomena are to a greater or lesser extent interconnected 
and interacting with each other. In that case, it would be 
senseless to introduce the concept of «system». 

In our view, to maintain that informal channels of 
scientific communication are more important than formal 
.ones would be as unjustified as to believe that a certain 
body organ or a system of organs (say, the nervous sys- 
tem) is of a greater importance for the human organism 
than some other organ or system (say, the heart and the 
blood system, the lungs or the skeleton). 

Explicit or covert contrasting of informal channels of 
scientific communication to formal ones reveals the meta- 
physical «sin» of many contemporary scientists and en- 
gineers who, while paying lip^^service to the dialectical 
principle of system approach to natural, social or mental 
phenomena, actualy yield to the temptation of simplifying 
their task by searching only for a single solution, which 
like a magic number or formula, o «philosophers' stone», 
Archimedean poin of support, etc. could guarantee the 
attainment of their goal under any circumstances. 

In this they sometimes refer to V. I. Lenin's words who 
wrote: <cThe whole art o( politics lies in finding and taking 
as firm a grip as we can of the link that is least likely to 
be struck from our hands, the one that is most important 
at 'the given moment, the one that most of all quarantees 
its possessor the possession of the whole chain».^*'^ 

As is evident from the quotation, V. I. Lenin spoke of .a 
concrete approach to phenomena; of the finding of the link 

-•^'V. I. Leniii. Coll. works, Vol 5. Moscow, Progress Publishers, 
p. 502 
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wludi is (he most imporlaiil one under the given condi- 
tions It needs not necessarily be as iiiiporhuit under dif- 
ferent conditions. Tluis, this injunction t^v Lenin is of a 
methodological nature and it does not concern the struc- 
ture or the essence of things and phetiomen;!. 

We do not possess a siifl%e'nt iinioiint of reliable data 
to question the assertions of the highly esteemed scientists 
(D. J. de S. Price, W. D. Garvey, B. C. Griffith, H Men- 
7.cl and others) who maintain that at present more' scien- 
tific inlormation is transferred through informal than for- 
mal channels, and that such information is more. valuable 
for science. Moreover, we are ready lo accept such asser- 
tions, although they are based rather on intuition than on 
scientific evidence. Tha't docs not, however, grant (he con- 
clusion (hat at present informal communication channels 
play a more important role in science than formal ones 
Such a conclusion might be particularly dangerous when 
the administration of science and Hs financing arc fully or 
to a great extent exercised by the State. 

Being interested in obtaining a ma.ximum «scien(ific 
ou^tput» to justify huge sums allocated for science, the 
State, with the help of financial and other measures, stri- 
ves to create conditions which, according to its official ex- 
perts, ensure the bes'l possible facilities for effective rese- 
arch and development. 

Inadvertent — and utterly wrong— assertions of an un- 
conditional primacy of informal channels of scientific com- 
munication over formal ones may lead (if they have not al- 
ready led) to a cut-down of allocations for publishing sci- 
entific and technical literature and thus may cause seri- 
ous damage to the development of science. 

Whatever the quality of informal char.nels of scientific 
c/jnimunication, ^their personal nature and spatial limita- 
tion.'? preclude their being capable of ensuring the genuine- 
ly international character and historic continuity of scien- 
ce. These are two of the many functions of scientific and 
technicai literature which in the social mechanism of sci- 
ence is, at least, as important as informal channels. 

Thus informal channels of scientific communication sho- 
uld be studied in their organic unity with formal ones, 
so to, speak, in vivo. These studies should deal not only 
wi'th the specific features, and properties of informal chan- 
nels but also with the mechanisms of interaction between 



informal and formal channels. 11 is these mechanisms that 
are among the least studied aspects of scientific communi- 
cation, although they are likely to hide great potentials of 
boosting the efficiency of scientific information transfer 
. and use. 

There is one more remark. A. Merta asserts that tiie 
system of scientific communication is a closed one. We 
cannot agree \vith this, for we maintain that the scientific 
communication system, just as all or almost all social sy- 
stems, is an opssi system. The hierarchical nature of its 
structure is the means ensuring the preservation' of _ the 
system's integral nature (closed systems preserve their in- 
tegrality due to a feedback with the environm :"t). We be- 
lieve that the hierarchical structure of social systems ex- 
plains the amazing circumstance 'that Zipf's rank distri- 
iDutions can be constructed for the elements of many dil- 
ferent social systems. Bradford's law of scattering descri- 
bes one of such distributions relating to scientific and tech- 
nical literature. 

The paper «Some Sociological Aspects of Formal Sy- 
stems for the Communication of Knowledge», contributed 
by D. Foskett (UK), is a brief review of the development 
of formal methods and means of scientific communication 
in their totality, which came te be more or less fully deno- 
ted by the notion of «scienti.fic and technical literature». The 
paper reflects great scientific erudition of the author, his 
rich practical experience, profound understanding of the 
subject, and the ability to express hi? ideas in a clear-cut 
and perfect style. I am sure that the readers will share with 
me the great interest and pleasure that I felt each time w^hen 
reading D. Fosketfs paper in compiling and editing 
this volume. 

One of the ways 'to overcome the state of crisis that has 
been taking shape in the sphere of scientific and technical 
literature because of the rapid growth of the number and 
; size of periodicals is, according to D. Foskett, the so-cal- 
led «Bernal plan». It will be recalled 'that the plan w^as 
proposed bv J. Bernal at the 1948 London Conference on 
scientific information and consisted in the following: the 
practice .of publishing articles in scientific and technical 
journals should! be completely or partially abandoned, and 
manuscripts of'-'these articles should be deposited in speci- 
al information centres. The duty of such centres should oe 



lO promptly notify scientists and specialists of all the in- 
coming manuscripts which may be of interest to them and 
to produce copies of these manuscripts on request. 

It should be noted that similar proposals had been 
made earlier (for instance, in 1933 by Soviet delegates to 
the International Geological Congress and also by Wat- 
son Davis). But it is thanks to J. Bernal that the propo- 
sal became known to broad groups of scientists and spe- 
cialists and, tiiorefore, it was justly called the «Bernal 
plan». 

Now there are possibilities for technical implementa- 
tion of the plan. Moreover, to an extent it is being brought 
into effect in the USSR. But the experience in deposition 
of manuscripts shows that if these are not «sifted» 
through a preliminary reviewing, this is bound to result 
in lower quality of scientific and technical literature and 
its devaluation which will be harmful to the general pro- 
gress of science. On the other hand, if such manuscripts 
are preliminarily reviewed, will this not sacrifice the main 
advantage of depositing documents before publication—- 
the reduction of time necessary to bring scientific infor- 
mation to the user's notice? Will the organization of a 
system for reviewing deposited manuscripts (naturally 
if the. reviewing is done as thoroughly as that of manu- 
scripts to be published) not lead to the emergence of te- 
chnical structures more or less equivalent to editorial 
boards of journals? These and many other questions sho- 
uld be given a more or less clear answer before accepting 
or rejecting the «Bernal plan». 

We believe that the stand taken by some periodical 
publications is morc'.promisliig. They carry only abstracts 
of (all or most) articles while the full texts of these — 
in the form of separate off-prints or copies on micro- 
fiches— are disseminated on readers' request. 

D. Foskett justly criticizes the «engineering approach)) 
'to problems of scientific communication which virtually 
disregards social aspects of these problems re- 
ducing these to the category of common mass- 
.production problems. The author also warns aga- 
inst the dangers that attach to the fact that in 
capitalist countries commercial publishers exercise 
an ever tighter control over publication and dis- 
^ seminarion of scientific and technical literature. It leads 
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not only to a rapid rise of prices for scientific and tcdmi- 
cal literature (during the past" 5 years they have fi:onc up 
by 10 — 15 per cent per annum) but aiso to a dcf^Tadntion 
of its scientific standards. To sum up, there are hardly 
any reasons to doubt that scientific and technical literatu- 
re — periodical publications, books, patent specifications, 
etc. — still is (and will long continue to be) the priiuipal 
medium of communication in science,, The particular ma- 
terial form in which scientific and technical publications 
will be issued, say, in some 10 or 15 years from now — 
whether as conventional bound books or on microfiches, 
or in. some other form — is in this case irrelevant. What 
is essential is that the institution of reviewers be kept 
intact and operational. We believe the latter to be an im- 
portant attribute of scientific and technical literature vie- 
wed as a social system, for it keeps the scientific quality 
standard of publications at a certain level achieved by 
science at the present stage of its development. 

Therefore, the identification and study of the proper- 
tics and regularities of scientific and technical literature 
are of very great theoretical and practical importance, as 
this opens up broad prospects for raising the efficiency 
of its function of scientific information transfer (disse- 
mination). 

Another very important property of scientific and te- 
chnical information is aging. Aging of scientific and 
technical literature in this case means a decrease in 
intensity (frequency) of the use of publications by scien- 
tists and specialists with the increase in the time which 
elapsed since the date of publication. To measure the 
obsolescence rate, R. E. Burton and R. E. Kebler proposed 
to use the so-called «half — life» period (i. e. the time 
during which a half of the publications of a certain set 
ceases to be- used). This period is obviously different for 
various sciences and disciplines as well as for various 
types of publications. The knowledge of the regularities 
of the obsolescence of scientific and technical literature 
makes it possible to work out rational acquisition policies 
of libraries (to be more exact, of systems of libraries), 
to optimize the file sizes of documentary information re- 
trieval systems (IRS), etc. 

The paper by B. Brooks (UK) who has successfully 
studied various properties of scientific and technical li- 
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teratiirc diirinn: the past low years, is dcvoled io Ihc pro- 
blem 01 aging;. ' 

This work excels in clarity and depth due lo the tact 
h.l 15. Brooks has a perfect command of methods of 
mathematical statistics and uses them sidllullv They in- 
variably evoke great inlerosl miiong all specialists in^ in- 
formation science. 

The article by J. Wanke (Czechoslovakia) dea's v/ith 
Mich parameters of documentary and management IRS 
as he frequency of the use of the file, the average stora- 
ge tmie of a record in the JRS, the avcrai^e recoid lenWh 
and the average value of a record - (a docun^ent) The 
ail hor shows that the third-generation computers ha vin- 
a high-capacity storage with random access are thus fa" 
more siiil.;uie for the implemeniation of management 
IKS, wmch are chararferized by a high file call frequencv 
a comparatively short storage time of records, a shorter 
record cng lwind a high value of records. According (o 
J. Wanke it is due to these properties that the manapc- 
ment iRS are developing faster than the documentary 



However when examining the reasons of these pheno- 
mena one should also bear in- mind other important fac- 
tors Proin a semantic point of view the search of docu- 
ments ,s by far more difficult: than the search of business 
mforma icn, since in the latter case there are no problems 
of an adequate linguistic expression in an information re- 
quest of the actual information need; there are no prob- 
lems oi expressing the main subject of the document in 
lis search profile or of establishing the criterion of rele- 
vancy between the document profile and the information 
request This is likely to explain (at least, to some ex- 
tent) ihe fact that users of centralized information sys- 
ems strive to develop their own IRS, which enable them 
to take a better account of the specific natur<=^ of their 
information needs and achieve better retrieval results. 

Besides, documentary IRS do not generally yield the 
same direct and tangible economic effect as management 
J Kb lor instance, ihe computer-based system for bookiu'^ 
air tickets. Last but not least, public practice demands a 
much greater number of management IRS than documen- 
tary ones. 
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R. Fairthorne (UK) has written for this collection tlio 
paper «Problems of Diita Retrieval and Dependent Techni- 
ques». R. Fairthorne considers data retrieval a special ca- 
se of document retricvnl, as the former seeks to find parts 
of documents (texts), and dociunent retrieval — complete 
documents. That this viewpoint is well-groimded l)Gcoines 
particularly clear if one recall:, the aml)iguily of the no- 
lion of «docimient». 

It is not difficult to give examples showing 'that under 
different conditions one and tlic same text can he conside- 
red either a separate documcnl or a part of a larger do- 
cument. In case of data retrieval the user gets texts which 
contain the answer, or, which is more typical, from which 
the answer to the given question can be logically derived. 
The finding of this answer (or logically deducing it) is 
the task of the user himself. 

R. Fairthorne points out that the estimation of the 
truth or falsity of statements, verification of facts and 
thoir use are beyond the scope of information science. 
This should be always minded by those militant infor- 
mation scien'lists who' strive to include into the subject 
scope of «information science» as a scientific discipline the 
content analysis and synthesis of information. These ope- 
rations are impossible without a qualitative evaluation of 
information, and the latter presupposes the knowledge and 
use of tiieories and methods of relevant sciences and dis- 
ciplines. ' C I 

Thus, data retrieval is a sort of contniuation of docu- 
ment retrieval: at first the documents are retrieved that 
mi"-ht contain parts of the text with the data needed and 
theli — these particular parts of the text. There is nothing 
peculiar about data retrieval that would suhstantially 
distinguish it from document retrieval in scientific, metho- 
dological or engineering terms. Whether data retrievar(as 
well as document retrieval) is used in natural sciences or 
in the Humanities does not add to it any specific character 
In hi;:, paper R. Fairthorne examines most important 
and pressing questions which today preoccupy many in- 
formation scientists. We are confident that this thorough 
and pithy article will be appreciated by the readers. 

The paper «A Condensed Heuristic Documentation Sy- 
stem» by R. Lerlercq (lielgium) and C. Rixon (UK) de- 
velops the concept of the .language .and logic of a tor- 
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malized logical informa'tiori system designed for uniform 
reduction of scientific information as well as for making 
logical inferences from it and for evaluating new infor- 
mation, not explicitly contained in the input texts (docu- 
ments). ^ 

In this connection we should like to note that we beh'- 
eve that the study and development of logical information 
systems in the strict sense of the term belongs not to in- 
forma-tion science but rather to the special sciences and 
scientific disciplines concerned, for any logical information 
system is founded on a wide use of a system of concepts 
logic, theories and methods of a certain particular science 
or scientific discipline in deriving and estimating new con- 
cepts and obtaining new scientific informa'tion. Informa- 
tion science as a scientific discipline does not deal with 
the content-l:>ased processing of scientific information. 

Likewise, mathematical logi", for instance, does not 
intrude upon those fields which serve as substantial in- 
terpretations and models of formal logical systems (cal- 
culi) it studies. It is evident that information workers who 
are specialists in relevant sciences and scientifc discipli- 
nes, carry out a content-based processing of scientific in- 
tormation when they abstract, annotate and index texts 
(documents), translate them into another language, wri- 
te surveys, etc. 

J. Halkin (Belgium) has contributed the paper «Pro- 
posal and Wishes for an Open Structure in the Communi- 
cation of Information». This extremely interesting paper 
concerns many important issues of modern information 
science and scientific information activities. Among these 
issues there are some on which we should like 'to express 
our views. 

J. Halkin considers that at present one of the principal 
ways of raising the efficiency of scientific information 
service given o scientists and specialists s decentraliza- 
'tion of the processing of documentary information and 
centralization of its transfer (dissemination). In other 
words, analytical and synthetic processing of scientific 
documents (i. e. their abstracting, indexing, etc.) should 
be done by specialists in narrow fields of science and tech- 
nology. The results of this processing- are transferred to 
relevant information centres which disseminate scien'tific 
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information by issuing socondary publications, magnetic 
tapes with indexed document files, etc. 

If I underst and J. Halkin right, such a demand — 
being certainly corredt in essence — may cause some be- 
wilderment. For it is precisely the way in which all more 
or less big information centres of the world really work — 
VINITI, Chemical Abstracts Service and Bioscience In- 
formation Service of BioJogical Abstracts (USA), Centre 
de Documenta'tion du C. N. R. S. (France), Excerpta Me- 
dica Foundation (Netherlands), and others. 

At present, over 23,000 part-time specialists work at 
VINITI, many of them living not in Moscow but in othei 
towns of the country. It is these narrow specialists that 
carry out (in a decentralized way I) the content-based pro- 
cessing of scientific- information while VINITI is engaged 
in centralized Fulfilment of all those functions wliich 
J. Halkin unites under the common title of «information 
'transfer». 

Perhaps, J. Halkin means the necessity of preventing 
wide duplication of analytical and synthetic processing of 
the same documents for different branches of science and 
national economy. But such duplication can largely be ac- 
coun'ted for by technological reasons and can only be eli- 
minated as a result of wide application in information 
practice of computer-based integrated information sys- 
tems (the papers by A. L Chernyi and P. Vinken are de- 
voted to this subject). Naturally, the use of computers for 
the processing of scien'tific information and for the prepa- 
ration of secondary publications entails a considerable 
change in the structure of the networks of information ser- 
vices. If this is what J. Halkin means, then we have no po- 
in't to argue about. 

J, Halkin touches upon one more question of principle 
which cannot be left without comments. J. Halkin shares 
with some scholars the view that scientific informa- 
tion is a commodity, to which we cannot possibly agree. 

What are the attributes on the basis of which scienti- 
fic information is regarded as a commodity? Such attribu- 
tes are likely to be the following: 

1. Scientific information is 'the product of labour. 

2. Scientific information is bought and sold, hence the 
conclusion that it has value and a price. 

Er|c^ 13 
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Let lis compare these atlrihutos wiili iho mn^f J...^^ 

A coMiinodily is tlie producl of labour whicJi 
— meets a certain. human need- 
fo-7o/e. for one's o^vn consun.plion but 

Scientific inronn;,!ioM meets hunum needs in knowlecl- 

«'.'iel- Is ' use'vn'ir ?>;'=''>' P'"'^"'''"^' "^'^^"^ 
,„i i-V i use-valiie is a commodi v. «7o l>erom" a cnm 

ic liansfenod to anotlicr wlioni it will serve as a use-va- 
lue, Ijy means of an exchanges.*' However scientific in 
formation is generally obtained not for sale ''^'^"^"'^ 

II one sees only tlie e.Nternal aspect of phenomena the- 
le arises an illusion that scientific information s riso m 
objec of hucljstering. But. what is in fact bought or so d 
lb. not scientific information proper, but //le riehl L 
cerlaui kinds of scienlific mfinnation for L Tr pose of 
gaining prof I. Moreover, only the right to ujl new lect 
nolog'ca information, and not any scientific informatio 

fnform.,1^on^ , '"l''' ^°\'^\ The supreme form of scien ,f c 
inlormat on— the knowledge of the laws of naturo so'-ietv 
and think,ng---cannot be an object of property a[ all- thev 
are not patented in any country of the world%,nd are no^ 
□ougnt or sold. 

Tile same is also true of scientific information dissen.i- 
nated through scientific literature, i. e. of the bulk of sci- 
entific information (though the carriers of scientific infor- 
mation—books, journals, copies of articles from them, pa- 
tent specifications, etc.) are bought and sold. However 
prices for scientific and technical publications obviousiv 
are not determined by the value of scientific information 
they contain: books by A. Einstein, N. Bohr, J. von Neu- 
mann and L. Landau are sold at the same price as compi- 
lations by unknown authors, and any specification to a 
U.b. patent^ regardless of the value of the patented inven- 
tion, costs t)0 cents. 

Even if any kind of scientific pubhcation could be bo- 
ugl-it and sold that would not be sufficient to call scienti- 
tic information a commodity. For instance, works of art are 

p aCvqI^T^^^' ^^^"s^"''>w, Foreign Lnnguascs Publlshinfr House. - 
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bought and sold but llioy arc cen\ainly not commodities. 
Naturally we are speaking in this case oT the originals of 
\vorks of art and not of their copic's. 

In his book «The Human Use of Human Beings» 
(1954) N. Wieiier stressed that mercantile attitbje to in- 
f«Drmation as a commodity, according to a standard Ame- 
rican criterion^ leads to the nii^iundcrstanding of inforana- 
tion and its associated coiicepis. N. Wiener wrote: «\V'hat 
makes a thing a good commedity? Essentially that it can 
pass from hand to hand witli the SLd:>stantial retention of 
its value, and that tlie pieces of this commodity should 
combine additively in the s^ime way as the money paid for 
them. The power to conserve itself is a very convenient 
property for a good comniodity to liave <. . .>. 

Information, on the oilier hand, cannot be conserved as 
easily, for. . . the amount of information communicated is 
related to the non-additive quantity known as entropy and 
differs from it by its algebraic sign and a possible nume- 
rical factor. Just as entropy tends to increase spontaneo- 
usly, in a closed system, so information tends to decrease; 
just as entropy is a measure of disorder, so information is 
a rneasure of order. Information and entrop>' are not con- 
served, and are equally unsuited to feeing commodities. 

And further on: «The idea that information can be sto- 
red in a changing world without an overwhelming depre- 
ciation in its value is fa I se.» 

.It is not appropriate here to dwell in greater detail on 
the question of wliy scientific information is not a com- 
modity. We have done it elsewhere. We should like to 
emphasize, however, that declaration of scientific infor- 
mation, a commodity inevitably brings into the scientific 
information activities a commercial spirit which is utterly 
inacceptable to all thinking people and which is having 
the same detrimental influence on it as on similar f-ields' 
of human endeavour — on science, education, art and pub- 
lic health. 

The paper «The Future of Access (Abstracting - and 
Indexing) Services» contributed by D. B, Baker (USy\), 
P. V. Parkins (USA) and J. Poyen (France) is of great 



*) N. Wiener. The Human Use of Human Beings. London, Eyre and 
Spottiswoode, 1954, p. 1 16 
**J ibid., p. 120 
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IC'ccast \Z: fli ',T'*' "'t --easons. The authors 

lo,ccat,t (ho development and improvement of sccondarv 
pubhca ,ons whicli serve and will long continue to serve 

"i™hlo'"' ^P'''^'^"-^'-^ '^"^ '"ain^guide through he 
r?t, Th f"""" °J '^'^'^"tific and technical lite- 

lature. Therefore, the elaboration of the forecast of Ihr- 
future development of this important f^eld of scienti e 
and uiforma ion activities is of tremendous tlSical 

f resl'^nt'T',^'^" ^'^"^ P^P^^ particular y h,! 

teresting as its authors are well-known experts of e-reat 

andTc4ntml -'T°"'"r'" ""^'^ °' inforj^rtion scK 

^ncJ scientific information activities 

Since 1958. D. B. Baker has headed the Chemical \h 
stracts Service (CAS) which during the I a st^ 2 years 
has developed and introduced into practice quite a few 

fcd"rot'ssin.°oV'^ /'-^f ^"^ methods 'of'' automT 

lea piocessing of . chemical information. Since 1907 CAS 

has issued the «Chemical Abstracts» - the biJs^st abst 
ract journal on chemistry and chemical engineering pub- 
shed in English and known to every chemist In 1Q71 
his journal published 350105 abstracts ofTucles U'- 
Icn specifications, scientific and technical p ogress re- 
por s and other scientific documc.iis, which ^s f 13 per 
cent increase as compared with 1970. Since its start the 
<<Chemical Abstracts, has published a total o over 5 m 1- 
hon abstracts. Besides, CAS issues a number of other se- 
condary publications. CAS has also developed and put 
into operation a computer-based system of searching or 
chemical compounds - the so-called Chemical ^ ry 
System m which the names and structures of some 2 mil- 
1Q79 Af^"''"^ ""'I'-'^'^i^/J substances had been recorded by 

wnHdlf^''''^T-^' °f "i^st automated 

woild information centres. Obviously the great experien^ 
ce accumulated by CAS during the 65 years of itrexTs- 
tence has been reflected in the paper. 

T r^' P^i'yns has been the director of the Bioscience 
i!n^''"iQri'"T^ ^^J^^'^^ Of Biological Abstracts (BIOSIS) 
H nJ^ i i^- ^^Smmng from 1928 BIOSIS has issued a lea- 
ding biological abstract journal in English — «BiologicaI 

i^n^ '"'.-^^ ^^Z^ P''^ published 140,020 ^abs- 

tracts of articles, books and other scientific documents Be- 
sides, in 1971 the «BioResearch Index» carried information 
on another 100,000 progress reports on research in biology 



During the last 0 years BIOSIS has scored great sue* 
cesses in the development and improvement of their se- 
condary publications and services as well as in the auto- 
mation of the technological processes used by it. 

Thus, P. V. Parkins' ideas of the future of abstracting 
and indexing services are leased on her great practical 
experience and the results of fundamental research. 

The third author of the paper «The Futurii; of Access 
(Abstracting and Indexing) Services» is J. Poycn who up 
to 1963 had been engaged in developing automated infor- 
mation retrieval systems for Compagnie des Machines 
Bull. Since 1963 she has been Secretary-General of the 
Abstracting Board of the International Council of Scien- 
tific Unions (ICSU AB). This international organization 
was founded in 1952. At present, its members are 13 ma- 
jor abstracting services of the world including VINITI's 
Abstract Journal (USSR);. «Chemical Abstracts», «Biolo- 
gical Abstracts» and «Engineering Index» (USA); «Bul- 
letin Signaletique» (France); «Physics Abstracts*, «Elect- 
rical and Electronics Engineering Abstracts* and «Com- 
puter and Control Abstracts* (UK) and others. In recent 
years ICSU AB has carried out a number of research 
projects aimed at developing and improving abstract jo- 
urnals in major branches of science, and has played an 
important role in tmdertaking a feasibility study on the 
establishment of a World Science Information System 
(UNISIST). Therefore, J. Poyen's thoughts on the ways 
of further development of abstract journals are worthy 
of great attention. 

Generally, we share the views on the future of abst- 
racting and indexing services set forth in the paper by 
D. B. Baker, P. V. Parkins and J. Poyen. But we should 
like, together with these authors, to emphasize once aga- 
in that the introduction of computers into the practice of 
scientific information activities entails a radical revision 
of all traditional concepts, a partial or even complete 
rejection of the existing stereotypes of technological thin- 
king. One can sometimes observe attempts at using com- 
puters to automate the same technological processes that 
had been practised in the conditions of manual or mecha- 
nized labour. This is a capital error. Such attempts can 
be compared with the behaviour of an airline manager 
who would so chart the route of planes carrying loads 
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and passengers in a mountain area alon^ the motor roads 
running through that area that it woiUd follow cxactlv 
every meander of these roiuls. ' 

The real value of the computer is not that it permits 
to replace manual work by machine operations and to 
greatly speed up the performance of tliesc. The computer 
provides prefcctiy new opportunities that a re .unthinkab- 
e with manual or mechanized labour. As J. Diebold no- 
ted, automation should be viewed as a phenomenon can- 
smg a change of the basic principles of production. Auto- 
mation enables the whole production process to be perfor- 
med as an integrated system, from the input of raw ma- 
terials up to the packaging of the product. He wrote that 
automation is «more than anything else a concept or a 
way of approach in solving problems and that it marks a 
considerable departure from many accepted practices of 
management».=''^ And further on; «Automation is not a par- 
ticular group of new machines. It is a new concept — the 
Idea of the self-regulating systems — and a new set of 
principles. Only when management understands this will 
it gain tiie full benefit from automation».^'='*=-' 

Automation demands a complete re-concepiitalization: 
LtI-^ ^ "^^^ ^^^^ thinking which permits to avoid the 
difficulties arising in using the traditional methods of 
problem solving, and to approach the problem from a 
new side. This approach should be adopted in designing 
any kind of system, including an information one. 

The paper A. J. Chernyi (USSR), «Integrated Infor- 
mation Systems», defines the concept of an integrated 
information system, discusses the major factors and pre- 
mises which led to the emergence of the systems of that 
type, and gives their generalized functional scheme. An 
overall description is made of the general approach to 
the creation of such a system on the basis of VINITI, the 
current status of its development, and the author's ideas 
on the possible structure of a nationwide system of sci- 
entific and technical information in the USSR to be based 
on a network of interacting integrated information sys- 
tems, «information banks» and scientific document repro- 
duction centres! 



^ I^'«2bo|fl. Beyond Automation. New York, Mc Graw-Hill BooU 

Co., 1964, p. 54. 

** Ibid., p. 55. 
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Tlie noxi paper by P. Vinkcn describes nn intcgraicd 

inrorninlion system v/liich is already operational the 

ciutt mated medical information system that was develo- 
ped at the E.xcerpta iMedica Foundation (Netherlands) in 
19G8, The author is Director (for Research) of the Exccrp- 
icii Medica Foundation. Mis paper is, therefore, perfectly 
c(,inpctent on the subject and! is extremely interestinc^, 
partiv-Lilarly m vrew of= the sc'ircity of published factual 
data on this — in ;i way, uniqiuc — computer-based infor- 
miation system. 

Tlk; l>ook ends up witii tlie paper by \V, Piroj?^ Direc- 
tor uF the Cenifiral Institete of Scientific, TechnTcal and 
EconoKMc Inr(o;?nT]ation of Poland (Warsaw), entitled «Tbe 
Use of Compters in Sciientific, Technical and Economic 
IiTform-^alion vn Poland». The paper surveys the major 
accompJishmenits of Polish speciia lists concerned with the 
inlroduciion of computer ifacilitiies into scientific and te- 
chnical! infoiriaf^itition worilk. It contains much interesting 
factual infonniiiation unlikely to be known to the broad 
aodience of iinformation scientists and officers outsido 
Poland. The paper affoirds an insight into the basic appro- 
aches of PoJi>h specialists to solving this key problem 
which makes ii especiallly valuable. 

So much ifor the coKteTits of the collection. In conclu- 
sion, w^e shooildi like to express the hope that this book 
will contribute to defuning more clearly the subject of 
information sc'ence anid its main objects of research, to 
identifying the key problems facing information science 
and, in this wsy, to promoting its shaping as a scientific 
discipline in iiillis own right. This is the main objective of 
the FID RseaTch Committee «Theoretical Basis of Infor- 
mation» 



A. I. Ckeniyi 



p. Afanasiu 

(Romania) 

STUDIES OF INFORMATION NEEDS 
1. Causes that led to the study of information needs 

The information system acts as an optimization relay 
in transmi'tting linowledge from information sources to 
users. The major objective of the whole complex of com- 
ponents and activities characterizing 'this system .is to 
provide the information required at 'the right time, in an 
■ assimilable amount and in a readily accessible form. Gi- 
ven ail this, one might wonder why stiidies of these ne- 
eds and requirements, underlying any information sys- 
tem, began to develop as late as 1948 111 and weh't on 
concentrating almost exclusively in one country until th? ; 
1960's. This was most certainly due to the difficulties en- 
countered in such studies. The numerical growth of users 
as creators of information, coupled with the proportional j 
increase in the bulk of document calls for such huge • ! 
efforts in terms of highly-qualified iTi,anpower consump- 
tion and equipment costs that the question of looking in- 
to the needs of information has lately become the focal ; 
point of modern information science everywhere. Any me- 
chanized information system requires, before being laun- . 
ched, the acceptance of its prospective users, i. e. the 
assurance tha't it will be used and will operate on a pro- ; 
fitable basis 12, 3]. 

Using available data from literature and over own 
experience in this field, we will try to outline the major 
factors inherent in any attempt at an information needs 
analysis in terms of sources, objectives, working methods 
and techniques as well as prospects. 

2. Data-gathering sources 

As information creator and user, the specialist, wha- . 

o tever his scale or field of in'terest, is the main source of 
ERIC 



data to be taken heed of when evaluating the needs of 
information, or, to put it more plainly, the requiremen'ts 
vis-a-vis the information system. 

The diversity and subjectivity of 'the users' viewpoints 
and of the factors acting upon the needs and requirements 
of information made some authors [4] to ques'tion the 
utility of users' needs investigation in information system 
design and others to consider ithe task of defining what 
information needs amount to as a painstaking or even 
■ impossible venture [5, 61. On this line, there are some 
supporting the view 'that information is a professional 
service which must impose on the user what it deems 
useful [71 as is the case with the medical sciences. 

Still in its early s'tages, the information science has 
not yet accumulated a sufficient amount of experience and 
virified data to be in a position to establish, with the re- 
quired certainty, what can be of topical relevance in the 
creative process out of all recorded knowledge. An analy- 
sis of !the needs of information users could make it pos- 
sible to thrash out in the foreseeble future some objecti- 
ve and 'definite criteria to help in assessing beforehand 
the relevant pa'ttern of information for each separate 
case. 

•Deprived as it is from physical attributes, informati- 
ons are liable to such subjective interpreta'tions that the 
differentiation between information requirements goes 
down right to the individual user 18, 9, 101. It goes with- 
i out saying that no present-day information system can 
^ be attuned to individual requirements and 'this holds true 
for the future as well. 

Modrn scientific developments by team work and - 
the identical characteris'tics of larger user communities 
make it possible for the requirements vis-a-vis the in- 
formation services to be controlled a't a fairly general 
level 19, 11, 121. These average needs and requirements 
must be taken into account in any user studies. 

The users' overt reticence and passivity about the in- 
\ formation system and most particularly about any break- 
' through in this field is common knowledge '112, 13, 141. 
The user should be directly involved in the informa'tion 
system operation in order to adapt himself to the techni- 
calities of the user — system interadtion. Getting to know 
o^^ie economics of the information system the user will 
ERIC 



succeed in gauging its advantages and will better under- 
s'tancl hjs duties when using the system 

^ Users' viewpoints reflect in all cases habits, practices 
wishes and needs in proportions hard to determine the 
techniques employed in evaluations beint? often rather 
uncertain [7J. 

la spiite of all this, the information system must be 
very sensitive to the users^ appreciations 'whatever their 
rate of uncertainity if the user is to resort to 'the infor- 
mation system 13, 6, 8, 9, 161. User studies must be ori- 
ented towards obtaining value judgements and determi- 
ning some hierarchizations liable to interpretation 117] 
which should supply the system with the necessary mi- 
idehnes. ^ ^ 

Data obtained from 'the «user» source must not he 
aken isolated. They are completed by and correlated with 
those resulting from (he system's technical and material 
capabi I'ty as well as from the goals and objectives of 
the institutions served by the system '[181. 

3. Goals to be attained in the study 
of information needs 

^ The specialist in its capacity as creator and user of 
informa'tion and also as main source of data presents se- 
veral aspects which are often rather difficult to isolate. 

Having a marked psychological nature two categori- 
es of interests are meit with in all user studies: those re- 
ferring to analysis of the creative process proper (where 
aspects of the intervention of recorded knowledge are su- 
re to be present) and those regarding the information 
How and its implications. The first category constitutes 
one of the main objects of s'tudy of the science of science 
with corresponding implications for the transfer of infor- 
mation, the second represents the principal objec't of stu- 
dy in the field of user information needs [19]. 

Altuned to the problems likely to in'terest or influence 
the information system, the study ' of information needs 
concen'trates on the following: user behaviour in informa- 
tion transfer, channels of information and the ways in 
which-the information flow is put to use fl?, 201 and al- 
so on the fac'tors heaving upon them. ' • ■ . • 



ERIC 22 



Researches relevant to the system have to be oriented 
towards practical problems and designed to gather in ad- 
dition to descriptive data some indications on the functi- 
onal rela'tions between user behaviour and information 
transfer and also some value judgements [17]. 

■ One to the practically unlimited amount and diversity 
of knowledge and to 'the huge number of users and the 
changing complexity of their requirements the informati- 
on system must tackle a wide range of fac'tors in a- per- 
manent dynamic interaction. The question of information 
needs calls for an approach on the complex sys'tems li- 
ne [21]. 

Analysis of the principal operations performed by an 
information system: 1) fields covered; 2) building of hol- 
dings; 3) document selection; 4) eondciising; o) inde- 
xing; 6) cataloguing; 7) storage; 8) retrieval and 
9) display, shows that users' needs bear primarily upon 
the first three and the last two of the above mentioned 
operations [21 , 221 . 

Hence the investigation is orien^ted towards: — estab- 
lishing personal factors, viz. users' characteristics and 
their. influence on the respective processes; 

— de"termining possibilities and limitations inherent in 
the needs of information following the intervention of 
external conditions specific to the social, poli'tical and 
economic system; 

— analysing the user-communication channel interac- 
tion; 

— differen'tiating between the needs of information ac- 
cording to the latter's functions; 

— performance assessment (evaluating performances). 
.Researches conducted so far generally confirm the 

afore-mentioned otojectives (see table. 2) and literature 
['21, 23, 241 spotlight their utility in informatiou system 
optimization. 

4. Methods and techniques employed in the study 

of information needs 

Bearing in mind that the main data provider is- the' 
user himself, the methods employed in such studies, just 
like those in psychology and sociology are based in . the 
^jrsl place on. combining introspection with . observation 



[81. They are recording habits, practices, preferences and 
appreciations and of late (after 1960) the effedts of infor- 
mation activities. Consequently, they are more of a descrip- 
'tive nature. Following recent developments investigations 
of an experimental nature have also come to the fore 
(see table 3) . . ■ ' " 

First studies covering the 1927—1947 period are cent- 
red ex'ckisively on determining tlie value of publications 
required by the need of ^library holdings selection follo- 
wing the information explosion. We may safely say that 
the study of information needs was ini'tiated by J. D Bcr- 
nal in his «Prelimin3ry analysis of pilot-questionnaires 
on the use of scientific literature» presen'ted at the Scien- 
tific Information Conference held in London in 1948 under 
the sponsorship of the Royal Society ['21. 

The rather different working techniques — bibliograp- 
hic reference and library statistics, inquiries, interviews 
dairies, case studies, critical incidence studies, system' 
experimen'tations — are generally based on the forme and 
questionnaire method 18, 21. Due to their limitations these 
techniques are most often combined. 

The data gathered mainly refer to the sources, prac- 
tices and conditions of information. The u'tilization' of the 
new techniques such as the critical incidence (effects of 
some information actions) and the system experimentati- 
on technique are mostly intended 'to obtain indications on 
the uses and results of information actions likily to cont- 
ribute towards defining and objectively differehtiating the 
needs of information and determining more reliable asse- 
ssment criteria than the more use frequency of hierar- 
chization. 

The intervention of modern electronic means in infor- 
mation enhances the tpnssibilities of data processing and 
interpretation in this , field of research [4] and makes it 
possible to continously check on the efficiency of the ser- 
vices rendered in terms of users' reaction. 

5. Results of the study of information needs 

Oriented .towards information system optimization, 
the s'tudy of information needs aims at establishing a 
correlation between what is useful for creation and what 
y-e user wants to get on the one hand and '^he system 



performance and the factors influencing the mentioned 
aspects on the other. 

Despiite the difficulties encountered in such studies 
and despite the fact that their limited number hns not 
made possible, for the time being, a wider approach of 
the whole complex of protalems"=\ the efforts made so far 
have led to results that proved to be of real use to the 
system. This gives hope and justification for con'tinuing 
and improving them. 

On the line of determining the factors that bear upon 
information needs researches have taken in'to account se- 
veral aspects. It has been generally considered that the 
user's activity is the main factor inherent in informaition 
needs modelling. Hence, the following activities have be- 
en listed as involving characteristic requirements: funda- 
men'tal research; applied research; design; ^production; 
education; management and administration il251. The pur- 
pose for which informations are 'being used, relative to 
their function, aithouRh an outstanding factor by its ac- 
tion on the nature of information has been dealt with 
starting from the information channels and by determing 
the functions which are better served by each one of them. 
This differentiation should be made the other way round 
starting from 'the function that must be performed and 
establishing the optimum sources and channels for its 
achievement 'fl7]. For all this a differentiation of the 
functions p'.,rfornied by informa'tion has been reached 
which will make it possible to delve deeper in this field. 

Grouped according to the activities they serve the fol- 
lowing functions come out: a) professional in'terests that 
change with the work stage, i. e. — finding of new ideas 
(selection of the course of action); — orientation towards 
problem approach (work planning); — classification of 
itheoretical aspects; finding of concrete data and solutions 
for .the problems raised while working on the project and 
of informations in related fields relevant to the proiect 
(the working stage proper); — synthetizing and fixing the 
results; b) current awarness as regards achievements and 



' The fields dealt with were mainly <2nginnering, chemistry a'nd 
physics, the focus being placed on fundainental research and in' the 
last deca'de m applied and production research, the samples investiga- 
ted being put to around the one thousand mark [2] 
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In.vtl'' specialized interest; c) orientation 

'towards new disciplmes or problems; d) editing of a .pa- 

fer^ncl 291 ""'"^ '''''''''''' ' 

The field of specialized interest lias an additional in- 
fluence on information needs and requirements depending 
on Its na i,re (w ether it is - itheo[etical or e.perimen^ 
lal), on the novelty and topicality of the siibjec matter 
namely on the rate of development, nature and releva 'e 
of its relations to other specialities. 

As regards the conditions of information, viz the in- 
formation environment, the latter acts according to the 
capabili'ty of coverage of the volume of recorded knowled- 
ge the economic possibilities of acquisition und publica- 
1 on, the character and size of the institution, and the sv- 
s em of activrty organization (by large or small groiV 
P ngs with wider or restricted participation on the part 
of specialists from other fields). ^ 

Thougli of a more subjective nature, the personal fac- 
ors tha mfluence the needs of information have been 
dry extensively surveyed. This was also due to the fact 
that 'the data provided hy the user in this field are far 
more re lab e. The interpretation and correlation of these 
data with the capability of the information sys'tem and 
with other tac ors still lies at the bottom ol the scak' 
Among these factors one can list: professional training, 
qualification, experience, know-how in the subject mat- 
ter and in related fields, age, knowledge of foreign Ian- 

The effects of these factors are manifest at both in- 
dividual and community level. It has been ascertained 
lhat starting from fundamental research toward applied 

cal1?eid.'?n V^f"^r\ '^""'i '^''^'^'^ ^'^^ theoreti- 
cal f elds to the factual and experimental ones brines 

about a more marked dispersion of the works, a sensible 

rise m the role played by the related specialities involved 

n solving the problems, a lower incidence in the direct 

ise of literature and an upward trend towards recourse 

to interniedianes (information services, libraries) and to 

processed materials in a readily accessible and directly 



Appart from the above-mentioned factors these fin- 
dings are also determined. ,b.y.th^.aipount pftime. ayajlab- 
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]e for information that users in 'the respective categories 
have on their hands. 

As regards the characteristics of information channel;^ 
an impressive m.ass of data has been obtained by studies 
conducted so far. The user knows better tliis side of tlie 
problem since he is constantly confronted willi the inflow 
of information intended to put to use ejiisting knowledge 
in his own field o,f interest and with the outflow of diss<^- 
minating his own scientific creation 1301. 

Data recorded so far mainly refer to ^he degree oi 
utilisation of the various categories of information mate- 
rials and to a lesser extend of the information services. 
Evaluations and differentiations in point of aims mainly 
refer 'to primary literature and direct contacts more fami- 
liar to the user than to secondary litei-aturc and informa- 
tion services. - - 

Both literature and face-to-face contacts arc consi- 
dered to be of paramount importance in finding new ide- 
as, identifying subject matter information and obtaining 
concret data and solutions. Literature reigns supreme in 
the field of discipline-oriented current awareness, in 
up-to-dateness in a new problem, in elabortaing a paper, 
conference, etc. 

The utilization degree of classical primary sources 
(periodicals, books) is highly disproportionate as compa- 
red to the less usual and never forms (patents, standards, 
dissertations, technical reports, propectuses) [1, 11], due 
'to external causes (poor accessibility, limited dissemina- 
tion, etc.) rather than to information contents. Consequ- 
ently, data on the :latter category of documents are still 
inconclusive and canno't therefore stand interpretation. 

Existing studies on primary publications provide fair- 
ly reliable data with regard to their importance, viz. .va- 
lue, topicality, lifetime, use frequency [17, 311. Such data 
are e.x.tremely useful for orientation in profile of holdings 
and selection of informa'tion. 

Extensive data have also been gathered as regards the 
deficiencies and the limited degree of use of secondary 
publications. The finding that these are first and fore- 
most the working tools of the informa^tion service led to 
sustained, efforts intended to bring them closer to users' 
requirements. • ' ' • 
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Insignificant use is still being made of the many dif- 
ferent services likely to be rendered by an inform'a'tion 
system. Lack of understanding of such services deprive 
the data obtained of their partinence. What does the user 
want to day from an information service goes far beyond 
the lat'ter's capability. We may safely say that this most 
important aspect of the use of the information channels 
is just being tentatively approached. 

In point of fact the data obtained in this field are ba- 
sed on the use frequency, opinions, evaluations and hie- 
rarchization of the various information channels. Such 
assessments imply a high degree of subjectivism and un- 
certainty.. New measuring methods should be experimented 
as some authors recommend on the line of recording in- 
formation activities and their effects f'l7]. 

Researches oriented towards determining the time 
consumed for information are inconclusive because they 
take the informa'tion process as a whgle and fail to dif- 
ferentiate between the .operations. Such differentiated 
appreciations would be of greater use to the information 
system. 

Conclusions 

The study of information needs constitutes one of the 
crucial problems of modern information science called 
for by the imperative of information transfer optimizati- 
on. Out of the wide spectrum of aspects of this problem, 
those which once solved can favourably influence this 
transfer should receive 'the greatest amount of attention 
on the part of the information scientists. 

Analysis of data sources, of objectives and working 
methods employed in such studies spotlight the comple- 
xity of the problem, (the uncertainty and subjectivity oi 
data, the lack of precision of the methods and the diffi- 
culties in devising objective measuring units. In spite of 
this researches conducted so far have confirmed that in 
some cases, results wi'th a fairly good concordance can 
by obtained. Apart from identifying a' number of factors 
that bear upon the information many data have been es- 
tablished as to the users' behaviour to the various infor- 
mation channels and indica'tions have been obtained re- 
lative to the users' differentiated assessments with regard 
to those factors. 
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Periodic sludies af tliis kind, in unitary working coil- 
ditions, on representa'tive samples, witli a most balanced 
coverage of the fields of activity, disciplines, categories 
of users and geographical areas pursuing improvement 
of working imethods by establishing reliable measuring 

Table I 



Distribution by time periods and by countries 
of user studies identified in literature" 



period 


Niitiibcr of 
works 


U. S. A. 


Br i til ill 


oilier countri-: 


Pcrcentfi^c 
of total 


Number Pcrccntafje 

of of totiil 
works period 


Number of I'ercen- 
nf works f'U'^ o{ to- 
till period 


Niiinbcr Percentage 
ot works of totiil 
period 


1927 


1 0,7 


1 100 






1928-1937 


9 6,2 


8 88 


I 12 




1938—1947 


6 4-,l 


6 100 






1948— i9G7 




39 78 


8 16 


3 0 


1958-1967 


78 54 


49 62,8 


16 20,6 


13 16,6 


Total 


!44 100 


103 71,5 


2E ■ 17,4 


10 11,1 



♦Data are based on a number of W user studies conducted in the 1927— lil;.? 
period and identified by us in litenture 



units and the application of modern com-putation methods 
in in'terpeting the data will lead to valuable results for 
the information system with wide possibilities of correla- 
ting the various aspects of the problem. 

Account will ihave to be taken of the fact 'that most of 
the data obtained generally reflect habits, practices re- 
sulting from given conditions. Experimental methods that 
reveal the effects of a certain type of information system 
on user behaviour towards the -s^y.stem and its performan- 
ce lead to far be'tter resuilts in terms of reliabiifity and ob- 
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j^tivity despite the difficiillics encountered in such stu- 
dios licy should not he ahandoned hut rather expanded 
hy all avaihihle nicans. 

One should constantly bear in mind lhat the needs of 
inlormaljon are in a dynamicaily upward trend within ihe 
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Major topics in user studies 



Sliulivs con- 
thiol olI iJi^lwi'LMi 
. I'X7 ,'iiut IPG? 
in port^L'iilajijos 



Stiulios still 
nmlerwny in 
IIKaS*' in [KT- 



B,4 

0 
2 

24 



22 
3>) 
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I^ecoiirsc io informn(ion transniiiting 
oliaiincis 

Soiiri-os of infortnnlioii; 
]-.v;i]untion of priJiuii v piiblio.-iiioiis- 
iililiznlion 

Studies of priiiijiry publiVvitions 
Sources of informalion idcntifu-iition 
I:valu;ilioii cl abstracts journals 
Evaluation of various inforuiatiou' 
channels 
Information practices 

Recourse to libraries and information 
services 
Personal files 
Conditions of information: 
Time of information 
Requests for bibliograpliic searches 
Exbastivcncss, pertinence 

Porformanccs of information 
Function of information 
Ways of InfornLation (general) 

wVshh,'j.lmrN:uio;rU^ nnd devclonmcnl in scionlific ciocnnonfalio.*, nu. 1.3 
w..!,mnf.ion, i\,iiioii.n Stionoo ]-oiiikI;iIioi) , IflfS, viii, 741 pp. 

user - information system and channels interactio.n, clo- 
sely following the last developments in science and tech- 
nology. This multiph-city and complexity of relations leads 
to a growing interest in the questiom of information n^- 
etls and is at the same time responszilble for the difficul'ti- 
65 encountered in establishing these ;rji!£eds 

Without seei<ing to exhaust this sabject with multiple 
anid diverse implications, the present paper is intended to 
place a somewhat new emphysis on some of its charadte- 
30.1 
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Table 3 



Dynamics of investigation methods and techniques employed 
in user studies (Figures illustratCt in percentages, the proportion 
of the works carried out in the respective decade) 
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Xuinbr of elaborated 
works 


1927 
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34 
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11 
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24 
32 


4 
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22 
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4 
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2 
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2 
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10 


9 


i 

[) 
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50 
78 



ristics in its endeavour to contribute towards clarifying 
a number of aspects and prompting additional interest 
for the study of information needs as an oiitsianding fac- 
tor in modern information systems optimization. 
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A. Meria 
(Czechoslovakia) 
INFORMAL COMMUNICATION IN SCIENCE 

introduction 

In contrast to technology, which has its outlets ope- 
ning into the sphere of practical activities, scientific rese- 
arch flows into the sphere of information. This makes the 
generation of scientific information sources and the moda- 
lities of information transfer a key issue of the present- 
day science. 

Information exchange is natural and necessary to the 
system of science as, indeed, to any social activity of man- 
kind. The organized information flow in science, however, 
differs greatly from its unorganized, stochastic or sponta- 
neous couterpart. We shall deal here only with «sientific 
information», i. e, information that belongs to and circula- 
tes in the system of science as a purposive communication 
field oriented toward creative work and its management. 
We^ thus disregard the well-known fact that the same 
«science information» can as well figure in other commu- 
nication fields, such as journalism, culture and aesthetics, 
administration, politics, etc. 

We proceed from the assumption that a scientist func- 
tions alternately as the generator and as user of scienti- 
fic information, the interactive relations, being very often 
remforced by this constant interchange of functions in 
particular, during personal contacts at scientific meetings, 
conferences, symposia, personal correspondence etc The 
social status of a scientist is also of great importance in 
'.nlormal communication. For instance, even a gifted be- 
ginner has a much smaller chance to get co-opted to the 
so-called «invisibie co]lege» of authorities in his field 
than an academician of nigh renown who has a long list 
of publications in the world's leading journals and single 
volumes to his credit. 
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The Mechanism and Function 
of Scientific Information Circulation 

Creative scientific work is normally divided into three 
lields of endeavour: 

(1) collection of facts and data (by experimentation 
or observation); 

(2) generation of new knowledge (by transformation 
of the data obtained or by abstract deductive techniques); 

(3) information work (gathering, evaluation and app- 
lication of the information obtained by others and com- 
munication of one's own findings to the community). 

It is the third component that creates premises for the 
social usability of the results of science/ This - implies ih^ 
moral and institutional responsibility of the scientist for 
taking a direct or at least indirect part in creating the 
best possible conditions for. the circulation of scientific in- 
formation within the framework of the organization in 
which he is carrying on his research. 

In an earlier paper on the social aspects of scientific 
and technical information communication (I, pp. 54—63), 
the author offered a detailed description of the process of 
generation of new knowledge as conditioned by the cons- 
tant and occasional comparison of one's own creative work 
with the general state of human knowledge. Prerequisite 
to this process is the recording and keeping of informa- 
tion in the appropriate artificial memory. This memory 
may be either individual or group-based, while its crea"- 
tion may take place through a formal or an informal ac- 
tion. Historically the evolution was from informal to in- 
stitutional information systems, that came to play in man- 
kind's social mechanism the roles of either measuring or 
control units, depending on whether they serve creative, 
managerial, productive or distributive functions. 

In the process of generation and transfer of scientific 
information, the scientist fulfils three main functions, that 
form three domains of information activities: 

(J) The informafion generator domain. 

Here, primary and secondary sources (reports and the 
corresponding papers, monographs, patent applications, 
etc.) are generated. These sources are written in the pe- 
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OLiliar objecl-langiKigcs of their authors that materially 
affect the objective aspect of the communications. Informa- 
tion transfer involves a certain conflict between the con- 
ceptual universes and the object languages of the particu- 
lar generators and users of information. It may be neces- 
sary, therefore, to build a common «retrieval language* 
of the formal or informal information system in question 
for tire two communicants and, possibly for their inter- 
mediary information officer as well. This language is used 
in writing appropriately indexed abstracts which form the 
subject condensates of the relevant primary documents. 
Such indexed abstracts are stored in the memory of a for- 
mal OF informal, information system. From the memory 
it is possible to retrieve information relevant to requests 
by any elements of the abstracts. The information systems 
as a rule do the hard job of surmounting the language and 
semantic barriers between the generators and users of 
information. 

(2) The informaiion user (receptor) domain 

Depending on his need, the user puts questions to a 
formal information system or tries to get the information 
he wants through informal channels. The pertinence of the 
answers he receives is a function of the degree of mutual 
understanding attainable through the contact of the dif- 
fering conceptual universes of the two partners in commu- 
nication. 

(3) The information officer (communication 

intermediary) domain 

In the case of indirect information transfer, the diffe- 
reing domains of the two communication partners are joi- 
ned by that of the information officer. As a result, the dif- 
ficulties of the communication process become still more 
aggravated and the pertinence of the information dwind- 
les. These difficulties reff^r mainly to institutionalized in- 
formation systems. 

Forma! and Informal Information 
Flows in Science 

Following Garvey and Griffith [2] and Menzel [3], the 
author has shown in an earlier paper [1] that: 
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(1) the informal (unplanned) communications in sci- 
ence predominates over the planned (instituional) modes; 

(2) Limited access infonnation, both oral and written 
(graphical) is growing in proportion; 

(3) the time intervals between the generation of in- 
formation and its communication to the user are already 
too long and continue to grow. The public (formal) in- 
formation network supplies to its users data 3 to 5 years 
old. And this at a time when the role of the spatial dis- 
tance between the communicants has palpably decreased 
due to the present-day communications facilities; 

(4) a significant regularity can be observed in the in- 
formation flow as regards the transmission channels used 
by the various categories of persons in certain predictab- 
le positions and with predicable priorities. The variables 
that determine the types of channel employed are many 
and depend on the scientists' attitudes, and the require- 
ments of the research institutions of funding agencies. In 
case the existing communication modes are unsatisfacto- 
ry, the information generator or user may create a new 
channel. Unfortunately, a majority of scientists nowadays 
strive to publish their papers in the official scientific jour- 
nals irrespective of whether it is efficient and economical. 
This desire is perfectly natural, in view of the considera- 
tions of personal prestige and the need for a safeguard 
against possible priority conflicts. 

Every information system is a closed social system 
with more or less strong and permanent social norms. The 
relations between its elements are dynamic, that is a 
change of some of the elements normally leads to some 
functional change of the other elements; 

(5) there are three dimensions, or aspects, in the dy- 
namic interrelations of, the information system elements 
(channels) : 

(a) Social dimension. The scientists themselves crea- 
te the channels to meet their needs, if the existing chan- 
nels fall short of it. These new elements affect the exis- 
ting ones through a change in the scientist's behaviour 
during information searching and transfer. This behaviou- 
ral change, in turn, modifies the functions of the rest of 
the transfer media,- and this has a bearing on the other 
main system parameters, for instance, by changing the 
norms. A case in point is the growing exchange of prep- 
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One can characterize the formal and informal ciiannels 
"in the following manner: 

1. The formal channels are generally accessible and. 
possess a broad scope of potential users; the informal ele- 
ments have, as a regular rule, a more limited scope oi 
users. 

2. ' Information disseminated through the formal chan- 
nels can be stored and retrospectively retrieved; in the in- 
formc. ^annels information is often stored but temporari- 
ly so iuiii it is difficult to retrieve it later on. 

3. The formal channels transmit a relatively more 
«outdated» information than the up-to-date information 
transferred through the informal channels. 

4. Information transmitted through the majority of for- 
mal channels is usually in the document form; that of the 
informal channels is documented but rarely and not regu- 
larly, 1 1 ji 

5. The formal channels are influenced mostly by the 
users, whereas the informal information flow is influen- 
ced by the initiator of the communication (the dissemi- 
nator). 

6. Every information system is more or less redun- 
dant, the results cf a research often being transmitted 
through more than one channel simultaneously, diflcring 
by the emphasis laid on this or that particular type of in- 
formation. Redundancy is less with the informal channels, 
which transmit the same information in varying inter- 
pretations. 

7. Interaction is lower with the formal channels than 
with the informal channels. 

Informal information exchange between scientists usu- 
ally occurs in the form of interpersonal communication. 
In comparison with the group and mas5 communications 
through journals, books, and published reports, the inter- 
personal communications pos.sess certain advantages, no- 
tably: ... . . 

1. High speed. A personal communication or preprint 
are ahead of the official publication by a few months or 
even years. It takes three years to publish conference pro- 
ceedings in Czechoslovakia, whereas in some of the Wes- 
tern countries this is done but in a few weeks. 

2. Selective information dissemination. This is particu- 
larly valuable in basic research, ^vhe^e the specialist scien- 



tist needs infommlion covering a wide range of llic al- 
lied fie ds. Tins properly is lacked by all formal biblioff- 
raphicai or mformalion tools, including indexes of all 
kinds Spi systems for a limited scope of patrons may be 
regarded as a kmd ol substltnte, especially if thcv emn- 
oy a conipnter for storage and retrieval and bave "power- 
ful peripheral devices; ' 

3 Selection evaluation and synthesis of the informa- 
tion transmitted. In contrast to a response that the rele- 
rence information collection or some other formal secon- 
dary source or service can give to a scientist, the respon- 
se to a question posed to a colleague has the following ad- 
vantages: ^ 

nn/inl/''% '^''1''^'!^"' 'f'" '"'''^''"y ^'^^^ information and 
not just refer to the relevant documents- 

(b) he will suppress in his answer the documents he 
considers nonrelevant (there remains, however, the risk of 
losing important relevant information, the answer largely 
depending on the individual biases of the person addres- 

(d) in case the questioner is a practicioner, he can ex- 
hnf Jc"^ scientist to communica'te not only information 
tice- ' ^^"cerning its applicability in prac- " 

nn il^ ^^'m P°^^sibility of getting «know-how» informali- 
nl.U '""ih^'- experimental itechniques, instruments, a - 
paratus that is usually ommited in scientific pap.^rs as 
uisigmfican and requiring much space to ■ describe; and 
..nf- 11 ■''^^ feedback in exchange of knowledge, that es- 
sentially accelerates research and makes it more efficient 
and saturated. It is a further advantage of the direc in 
terpersonal communications that the in'teraction hebs 
specify the requests and better to formulate the answers 

Special Role of Casual Information in Science 

Occasionally acquired information continues to play a 
necessary -and ifhus far not formally described - part 
in science. Such unplanned communication can occur both. 
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m interpersonal and group or mass intercourses Leaving 

• fon's' nnlor'''°T' ^-'''T-^'l' ^'^^ physic.^ world (N v^^^ 
on. apple), most scientists will confirm tliat they hap- 
pen to come across excop'tiona lly motivating in onm ior 

cld 'hi'ofMrn^ T^'''''' ^^^'^"^'"^ lecture " : 
cicl lai-off from their own. Menzel (3) no!es the nhlnn 

nienon as yet iinexpJored, that the n o ivationa v-i c of 
mformation is disclosed only in a repeated ? c4 nVo il?o 
sa.me information after a lapse of tin e in a difforen" 

ce"s1;;%"alin"r ^^'^ eT : 

. ccpsary qualification and experience for revealina the des 

n?^in' °^ ^^^tual value o that info : 

mation. This repetition cannot l)e planned cither bv nin 

: user or the information system tha serve h A 'clas 
col examp e is the forgo^tten inventions tl T wte lulv 

: appreciated only years after their first anno .ncemen A 
possible explanation may be that certain innovathe 1 
appear much ahead of. the feasibility of tbeir soln im 
p ementa'tion as granted by the intellectua morni or 

; ot er TmnerfP Hnnf ^nipulses, redundancies' and 

oiner imperJections of the communications systems mav 
; happen to be beneficiaJ to society. systems may 

I /. I'lieradion of formal and informal comniunicalions 
,' channels 

: The two forms of communication have been gradually 

aTd S^lion'nrf '"'^'f'^^"^ ^^'^ '^'^ mechLization 
; and automation of formal communications. This is mani- 

; fested m an essential reduction of the publicaC time 
organized exchange of preprints among the members of 
, .exchange groups, computer-based SDI system sottini 
.. up of special departments for analysis and sym^^^^^^^ 
: specialized discipline-oriented information ffo example 
. the analysis group at the Central Office of Scientific Te 
■ f^l^lho','"^ Economic Information), publishing of ^ids 
textbooks and surveys, programmed texts, etc. All this is 
; washing out the sharp borderlines between the two o,n s 
; of commumcat.on. Premises to this development are ela- 
: boration of a clearcut conception of the discipline conce?- 
. ned and compilation of good special dictionaries CoS 
; witli the technical facilities for remote gathering and 
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'transmission of knowledge, this opens up unprecedented 
prospects for building powerful information systems of 
verbal and non-verbai conununication. 

The seemingly Utopian dream of instantaneons feed- 
back between the communicants has been given the hope 
of coming true in the information systems affording 
«user~-computer» dialogue. The present-day program- 
ming experience attests to the feasibility of machine cont- 
rol and addition of standardized information elements to 
search prescriptions. Naturally, this mechanized modifi- 
cation of the subject specification of a request at every 
retrieval phase is conditional on. algorithmization and' 
progranuiiing of interactive relations between the requ- 
estor and the information system, which has been so far 
a very complicated problem both for the information sci- 
entist and the programmer. Among the few papers dea- 
ling wi'th this subject one can cite a study by Don Swan- 
son [61, 

The efficiency of formal and informal communication 
channels depends on three factors: 

(1) the intellectual content of the communication, cha- 
racterized by the measure of ambiguity of the conceptual 
vocabulary of the field in question and the symbols and 
codes used for communication; 

(2) the social structure of the communication partners. 
Of major importance in this respect is the size of the gro- 
up, the number of sources and the degree of commonness 
of the interests of the communication partners. 

(3) the ideologies of the user group. Basic differences 
will be observed in communication between fundamental 
and «applied» scientists, production engineers, tradesmen, 
etc. As shown by a field study conducted by the author in 
the Czechoslovak Academy of Sciences, significant diffe- 
rences are observed also between the social, natural and 

. technical sciences. 

In basic research the number of communicants is much 
smaller than elsewhere, their scope .of interests is narro- 
wer and 'better definable, which makes for a certain si- 
milarity of the information needs. On the contrary, appli- 
ed research and production users are marked by a greater 
diversity of vocations and greater differences of the in- 
'tellectual standards, which expands the variation range 
of and deepens information requests, While basic rcsear- 
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chers. give preference to informal communication and 
regard tlie evaluation and synthesis of the contents of 
relevant documents as inseparable from their creative 
approach, engineers and other practitioners opt for cen- 
tralized information services and special libraries. In this 
latter group the language barrier is of a greater weight 
(see below for more detail) . 

The development rates of the discipline concerned 
exert a definite impact on *the information p-ocess. Rapid 
personal and physical c]ianges in the structin-e of the 
discipline usually lead to serious communication handi- 
caps. This fact is well known in Czechosiovakia where 
science, engineering and the national economy in general 
. are engaged in a continual reorganization. The transition 
. to the federal state structure and to the new economic 
; management methods is bound to entail profound changes 
^ in the techniques and organization of the system of scien- 
'tific and technical information in the coivnTtry. 

For ail the advantages of mechanization and automa- 
tion in the field of formal communication of special in- 
; formation, all kinds of informal exchange will continue 
; to play an important role, par'ticularly, in the sphere of 
; basic research. The effective organizaticr- of personal con- 
tacts between the eminent and rank and file scientists 
; and equal opportunities of access to i-itirmation for all 
'the users irrespective of their social staius will surely 
much raise the economic efficiency and! efficacy of com- 
: munication at all levels and stages. Forniial communica- 
tions lack the adequate means to assess Ibe relevance of 
external information. Informal communication is «open», 
in that it permits ^to carry on a dialogue with the com- 
municant, selecting the necessary information, including 
even such as the user would be i'mcapable of asking for 
in advance as it is only in the course of the dialogue that 
he lea rns of its existence. These specific needs are chan- 
; ging in the process of the problem solution, depending on 
the nature of this problem, working methods and techni- 
ques he applies, the approach to the description of his own 
results, phase of research, etc. The formal .channels are 
far from always capable of real-time adaptation to the 
speci'fie and direct needs of each particular user. 

The informal channels give the scientist direct sup- 
port or •criticisin on the vague points that occur in his 
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work. This feedback in the dialogue with the o'thcr mem- 
bers of the learned community in one's field contributes 
greatly to the efficiency of the research. 

It is highly expedient sometimes to combine -the two 
types of channel, e. g. to find a source through a farmal 
channel and then to establish direct informal contacts 
with the au'ihor. 

Exchange of preprints has come to be most used in- 
formal channel of late. To say nothing of the difficulty 
of reproduction and circulation of preprints the author 
has to cope wHh, this channel involves certain other 
shortcomings, the major one being that junior research 
staff, the smaller serearch establishments and scientists 
in the smaller and less developed countries, who need 
preprints most of all, have the least chance to obtain 
them (if at all). Therefore, Garvey [2] suggests as a so-' 
lution to the problem of lengthy time lags between com- 
plctjon of a research and its announcemeTit in a journal 
(\~2inj'ijig between 1 and 2 yeans) 'that lists-of manuscripts 
submitted for press be published with the names and add- 
ressiES of the authors. In this way informsl exchange of 
prepirints would be :streamlined to foreshaifiow the actaal 
publicaition of the results in the journal. In Czechoslova- 
kia preprints and reprints constitute a majoE form of in- 
fonraaJ communication in research institatiss^. Unfortuna- 
te k, ike scientific journals in the country fiua-e so far fai- 
led] tto implement this happy idea of GaiB^-y's of publi- 
shiiiig lists of titles and authors of papers submitted, 

Pablishing of conference pi^e-papers among very 
efficacious semi-formal methods. A special bulletin of 
brief communieations is a novelty in the field. This deli- 
vers the official journal for the more important and leng- 
thy publications. 

A eombination cf regular conferences of specialists 
with the timely issuance of pre-papers is the most ef- 
fective method of semi-formal communication, thanks to 
which the participants coming to the conference have a 
more or less exact notion of 'ttie relevant studies so that 
they can tie up personal contacts with the authors, if they 
wish. The speakers can thus concentrate on a discussion 
of their results. The convenors of major in'ternational con- 
ferences always take pains to publish in due time the ma- 
terials for the future conference. 
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Aloney permilting, after the conference collections ol 
the major papers are published 'together with the discus- 
sions A tangible percentage of these papers later on ap- 
pear through the next year or years in scientific journals 
or monographs at home and abroad. 'Ihese publications 
are of particular value to those who did not a'ttend the 
conference and, certainly, to the information agencies and 
'special h'braries that add them to their files. 

We can say in conclusion that, as shown" by studies 
carried out in Czechoslovakia and abroad, it is expedient 
to attack the present information crisis by combining, m 
accordance with the specific :needs of a discipline or re- 
gion the con.ventional formal channels wi'th the inftrnnal 
ways of coniOTunicatiou. Interaction of the two forms is 
bound to promote the permanent improvement of both It 
would be necessary, however, to abandon 'the intuitive 
method and to approach this probiem with the tools oi 
rational empiric and theoretical analysis. All users _ot 
scientific information are eagerly looking forward fe its 
solution. 



Influence of the Language Barrier on Formal 
and Informal Communications in Science 

Prerequisite to effective communication is that the 
svmbolic system employed be understandable to all the 
partners As regards oral communications, this implies 
not only the understandability o^ tfe unit signs, such as 
the phonemes and letters of tka common alphabet, but 
that of words, phrases md larger text segments. Other- 
wise, in case each author uses a different national lan- 
guage to report his information, we are faced with 'he 
obstacle of the language barrier. In the context of the 
international nature of scientific informa'tion turnover, 
the language barrier is a big handicap to the effective 
communication in science and technology. 

The problem of language barrier should be explored 

under three angles: . • j i- • 

(a) inter-language barriers (symbolic and linguistic 

^'^^(bMntra-language barriers (terminological diffe- 
rences- of the particular subject fields); and (c) semantic 
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St c domains of social activities, 'tJiis politicolcu] u,- 1 
c .in l eclinical informalion communication The S 

thm /'.^^^'^'dwide purport are practically forced if 
t ey. want international recognition for themselves 'an 
U eir nation, to publisii and.communicate in Tome lanmn 
ge of world importance and to use the intern-, innll^ 

bardcHrscienc"'"'" ""^^ ^"™<'.""« language 
I Accepting one of tfe major languages of the world 
as the universal internailiiamal language of sc encp Th 

0 her big and small nalions, even though the practice is 
hat without the knowledge -at least passive- of Ene 
isl Russian French and German a scientist, part culaf- 
ly in a small coun ry, cannot attain scientific recogniti- 
on The majority of special terminotogies make I'sf o 
Latin roots which to some extent simplifiers ti^e comma 
nicafions between scientists of different nationalities A 
Se clVf""'"' " T.^''''"'- These circumstances may' be 

1 ven hv " ■ preference that has been 
given by scientists to the English language in the 
past two decades, apart from the dwindlingTof the inter 
national prestige of France and Germany^ as world 

2. Creation of an artificial international language of 

science, a sort of Esperanto for scientific commimicafion 



on a world scale, the er^perience with Esperanto and olhcf 
artificial languages sl;Vi'i)^v s this to be a difficult palli as 
,i't requires that every :^cientist devote a lot of his time 
lo learning a new foreign language, B.esides, it is diffi- 
cult in itself to build a universal artificial language for 
the science as a whole. 

3. Gradual development of formalized languages for 

the individual' fields of science. Examples -are chemistry, 
physics and other disciplines possessing an articulated 
systema'tic ogranization of concepts. 

4. Creation of international retrieval languages using 
generally accepted systems of symbols for the individual 
fields of science and technology. This paith is made prac- 
tical in case there are information agencies functioning 
as 'the intermediary Oif scientific conTmunication. As re- 
gards the immediate intercourse of the communicants, a 
commonly known national language will have to play an 
important role for a lom^g time to cornc. 

5. Translation of documents from one natural langua- 
ge to another on user's request. This is the satisfactory 
least solution as it considerably retards the information 
turnover and is fraught with the danger of unvoluntary 
or iirteinded distortion of 'the knowledge translated. Un- 
fortunately, this way is indispensable in case the user do- 
es not read tiie foreign language in question. To reduce 
the disadvantages, some -major countries conclude agre- 
ements for reciprocal 'translation of whole series of their 
scientific journals. For example, about 80 periodical titles 
in the USSR and the USA the being translated cover- 
to-cover according to the cultural agreement between 
the two nations. For economic reasons this solution is 
unfeasible for the smaller countries. 

Linguistics, including the ma'thematical linguistics, is 
trying to overcome the language barrier via machine 
translation. The problems involved in the algorithmic 
descriiption of the translation process, particularly as con- 
cerns seman'tic and syntactical analysis of the source and 
target languages, will for a long time ahead doom ma- 
.cliinc translation to the status of laboratory and semi- 
operating experiments, even though the total number of 
research reports and various publications on the subject 
amounts to several thousand. 
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« Invisible Colleges» and their Role Im Science 



It has been mentioned iUni the iTdprctauJ f-oinmal and 
informal comimtrnkvations between sciiiirr lien's atie a major 
stimulus promozi^'riig research activilie^. Tl-h commimica- 
feons can. .take pi^ice in the interpcr^so^ial planr. according 
to Schramnii's sciieme [4], or in I'lte group-based plane, 
or else, on :a- liTLiLted scale, with the aid of the mass me- 
dia. In tile former case the communication process comp- 
rises the followmg components: ^the speaker's idea — the 
semantic content gT the message for the speaker — the 
semantic content of the message ~ the sema:ntic content 
for the receptor — and the effect produced on the recep- 
tor. In the case ol group communication 'one has !to take 
inlo account the motivation pattern, flue intenial' horizon- 
•ta! and vertical breakdowns as determiined l)y the roles, 
norms, statutes ansdl hierarchical relations iin: 'the group 
and its universe:. IThe differences in the s.{iiiili!n±al hierarchy 
correspond to the different roles in communication. 
The senior and mranagerial staff usually jmit m&re initia- 
tive in the communication process than io the junior 
staff. 

The above factors lead 'to the emergence in science of 
the so-called «invisible colleges». The leading and re- 
nowned scientists form, on a voluntary basis, national 
and international groups without any official or- 
ganization but wi'th the main objective of information 
exchange through informal channels, in particular 
through personal contacts and correspondence, partici- 
pation in mternational congresses, conferences, symposia 
probation service, exchange of preprints (copies of manu- 
scripts accepted for print), and reprin'ls (offprints of 
published papers), and so on. These interpersonal or 
sometimes, mediated contacts between the members of 
the invisible colleges elevate these informal groups exis- 
ting in almost every field of science to the level of mightv 
mformational ?nd brain trusts that constitute a mo«t 
efficient informal means of communication. The first to 
sjtudy the- behaviour of such specific social groups were 
Price and Beaver [5]. The object of their study was the 
so-called <<information exchange group». There exist in 
the United States such groups of scientists of different 
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speciailics that have unclcr'taken as members of :an inv • 
sible collegv to send in to the group's secretariat a gujv,, 
of each paper ready for press. The secretariat multipife- 
these manuscripts and disseminates them as prepriiiiti- 
among all -the members of the group. This simple sysiliem 
has practically delivered the larger groups, uniting up tc- 
r>00 leading scientists, of the necessity to publish an rffi™ 
cial journal, for the members of the group received aT"; 
information on the major studies in their subject fekil 
severaii moiiths ahead of official publication. Naturallik; 
other forms and channels of communication were aifev 
lormed between the members of the group, as describtcu 
a1)ove, and even direct scientific and technical cooperanit- 

on. Price and Beaver must be given credit for their qi^ 

aniti'tative analysis of the social behaviour of the exchan- 
ge group in the measurable indicators of the publishijia- 
activities of the individual group members; besides tfe^v 
examined the related growth trends in joint authorslrai. 
Judging by the number of publications, 'that Price assfc 
mes to be a measure of a scientist's productivity all mmx- 
bers of the group were ranked in the diminishing ordei 
of their publication amounts. Besides, correlation betweeirj. 
the productivity and participation in the group was me-ar- 
sured, and this dependence was estabhshed to be direcii 
and symmetric. The previously known tendency towards 
the growth of joint authorship was confirmed. A valuable 
contribution of the research was -the extension of the ana- 
lysis to embrace the relations of the individual scientists 
to the different groups and the interrelations between ^the 
groups. It would be of great use for theoretical research 
in informatics in Czechoslovakia, if these methods were 
appli:ed there. 

Price, like many other informa'tion scientists, resorls 
to citation analysis to study group communications. Dom 
Swanson [6] makes use of citation analysis even to iden- 
tify the membership of invisible colleges. 

We should like to conclude this brief outline of the sco- 
pe of subjects connected with the information exchange 
through invisible colleges, this most efficient channel, by 
mentioning a major shortcoming of this form. Regrettab- 
ly, to become a member of such a group is possible only 
for eminent scientists possessing a social status thai 
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enables them to keep sufficienily close cont^icts with the 
other members of the group, 'to t.'ike part in conferences, 
to ravel, etc The choice of the members for an invisible 
college IS quite conservative. A new member is usually 
required o have'high scientific qualifications and renown 
confirmed by a multitude of publications in world press' 
and, in case of a smaller nation, 'the command of the ma- 
in languages, as no interpreters are tolerated to mediate 
the persona communications between the members of the 
invisible colleges. This problem of the language barrier 
IS of prime importance for the scientists of smaller na- 
tions They need the knowledge of world languages not 
only for personal contacts, but also for publishing their 
results in the world national and international journals 
since, It is only the separate offprints of special world 
press that grant the entrance to the invisible college 
Another opportunity for the scientists of the smaller but 
developed nations to enter the world scene is given by 
the current-awareness periodicals of the «Current Con- 
tents» type, that give translations of the contents pa- 
ps of the national scientific journals into a «world» 
language accompanied by tile authors' addresses — which 
cont'icts establishing persona! 

All junior scientists naturally aspire as soon as dos- 
sible to becomemembers of an invisible college of their 

leld. The knowledge of world languages, over and above 
me special'ty, is a basic premise to a scientific career in a 
small country. The capability of overcoming the language 
barrier on ones own is typical for the field of basic rese- 
arch. On the contrary, in applied research and in the pro- 
duction sphere this capability is lower. This faces the in- 

orma'tion officer with .the major task of bridging the 
language gap between the foreign sources and the user 
Hence the usually higher social status of the information 
officer m industry than that of the informa'tion officer .in 
basic research. 

The mass communication media, including the grap- 
hic ones, are typical of the formal (planned) communi- 
cation, and we shall not dwell on them here. 

The following table compares 'the three modes of com- 
munication. 
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Comparison of Characlerisf ics of Inlcrpmonnl and G™„p 
tommunications and Mass Coinmunicalions 



^nterperso^^^l 
communicntion 



Group cornniuniciid'on 



Author njid 
ceptor «re hi- 
vidiujls 



coinnuinic.ilion 



Aiitlior nnd recep- 
tor ;3re either irulivi. 
uiinls or groups 



.1 he source of conmuinicntion 
Jill nuiivifltinl or mi orL^n- 
ni>:^)tion, the receptor is the 
coiumimity. Stochastic nn- 
uire of the conununication 



'SIS '^f^^-sr;';.! ^ouv- s„.a, 

^"^'i'-" roles, norms. 



Jctal status) 



■stnictiircs witli the meJi;.- 
iion of individucils 



^AuUior and receptor .re in direct 



Author and receptor usual- 
In space nnd/or 



'i Author addre- 
'CS a particu- 
3r person 



Author addresses a 
piirticular person or 
members of a group 



Author .'uldrcsscs a set of 
receptors (the set may be 
empty) 



No direct (immediate) ev- 
chfinge of rotes takes place. 
Indirect exchange of rotes 
involves a time shift 



Physical presence of receirior in 
he act of communication islcessl- 



Physical presence of re- 
t^epfor not needed 



I ^^ecepfor has no free choice of the 
;on^unication subject in Us fil^? 



Receptor has opportunity 
of choice of the cimmunica- 
t>on subject (negation of 
communication) 



; It is possible to comprehend- the 
ntire conununication process 



rd A ''"possible to comp- 
rehend the entire commun - 
cation proce:;s 
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Conclusion 



The objective of this paper was to emphasize the fac 
that the informal channels of information exchange i: 
science cannot be replaced even by a best organized for 
mal communication media in the overall com-plex systen 
of the communications model. Informal — specifically 
interpersonal — communication contacts between scien 
tists will continue, even at a stage of high automa'tion o 
scientific information processes. 

The need for finding an optimal interaction bctweei 
formal and informal contacts should not be altogethe 
onmiitted from any scientific information project, since 1 
is only optimization of this interaction that can brin^ 
about a high efficiency of such a system. 
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SOME SOCIOLOGICAL ASPEKTS OF FORMAL 
SYSTEMS FOR THE COMMUNICATION OF 
KNOWLEDGE 



The origins of formal systems for tlie transfer or con ■ 
munication of l^nowledge are lost in the mists of time bu 
h V were certainly in the liands of rulers usually l(mgs 
or Jriesls. Discoveries such as those of t e Sumerian 
and Egyptians, connected with seasonal ebljs and Hoods 
of th rreat rivers, so important for agriculture, were han- 
ded ^ ^ by oral tradition, and, if recorded, were so m great 
secru', The growth of market economies, and the deve- 
lopment of international trade, led to an increase m the 
need for records, of commercial transactions; but the no- 
tion that specialist knowledge was the prerogative of lie 
elite lasted long into the history of civilisation, a ^ the in- 
vention of the papyrus roll or even of the codex form of 
b ok "n his life of Alexander, Plutarch records a letter 
that he wrote to his tutor while he was in Asia; «Alexan- 
r t -Aristotle. Greeting. You have no done wel o pub- 
lish your books of oral doctrine; for what is there no.v 
that we excel others in, if these things which we have be- 
en particularly instructed in be laid open to all? For my 
Dart assure you, \ had rather excel others m the know- 
edge o what is excellent, rather than in the extent of any 
: power nd dominion. Farewell. Tlje fre-Socratic ph. o^ 
■ soohers who made so many impo^~nt iscoveries, and 
: P ?o himself, whose influence on *ae I s orV « ideas can 
' hardiv be exaggerated, evidently pa.u h tie attention to 
^ ic'^ation; but Aristotle clearly recogmsed the value . 

: the Aook. for recording a 'yste"i<^tl«],^«f f , "1? ^ 
i iled argument that requires to be studied at length, with 



Ilie possibilify of conlinuoiis refcrci.cc back and lorlli frn, 
one par lo anolhcr. 11 is forlunalc ll.al , ^ 'f r 
m 1.1".'^° ^"^™-R°nian civili.s.lion M no „, 

riie iiKcniion of printing from movaible Ivpc wliir 
J ■ J ' " tremendous impeliis lo the snread n 
dubtry ,is llie base on which scholarship could bo founded 

minus in llaly, were scholars as well as craflsmen -iiirf nf 
en conibmed in themselves all the arts o edTo"' int 
publisher and bookseller. Groups of scholar w "the 
ormal such, as the Wandering Scholars, or fo^ta I co 

B srn n ii'^'a^T'''' "^^ l^^^thr^n Sincerity ' 
Bdsra or the Academia Secretorum Naturae in Italy hai 
no played any outstanding part in publishing but ha 

ad n^h'^''.''' "V^'' r.n instruJenl s r 

admg tbeir ideas. This aspect of bibliographical histon 
has been well documented by Thornton and Tully I 

nninicatbn T ri'lf ""'j" medium' of com- 

r s ,)'ill . 'VT^^ ^''"''"y '"iP°r'»"' medi- 
um as ,1 does, but of a less formal character. This im- 

portance was recognised, however, and volumes of cor es. 

on ence have akv^ys provided vi^tal evidence o a M s . 

ac er of the letter, however, differed from that of the book- 
V s not, generally, used as a medium of expressing a 
nil i-equence of ideas which amounted lo a complete oic 

ion ' " "'her, the means fof commu- 

nic ting quickly and easily a single item of informal' 
0 one who was cer ain to be interested, and thus Twas 
s a form the direct ancestor of the modern periodical. It 
was the Scientific Revolution of the 16lh and 17tii centu- 

scholar'i wk"^^' f°gff.the formal group of 'like-miiided 
scnolars who converted into an institutional form what 
had pre^Mously existed as a rather haphazard enterprise 
between two or three .friends. ^'Herprist 
The particular social need to be met was that of trans- 
erring highly specific knowledge so th.; it- could be put 
■to use; the motive pover came from the fact that these: 
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scholars were scientists, engaged in practical experimen- 
a work Tliey realised all loo deary how much they stood 

'g i V'xchange ot knowledge. A'lho«g ey ; 
turally anxious to have their name nued ^."^ ,8 '''' ' 
coveries they also understood, in the words ol the man 
r oae of the chief 'ounders of the moveme.^^^^^^ 
cis Bacon, that every man is a del, or to h s P^Q 
Furthermore, these men were gifted ama eurs, n he s n 
se that they were inspired more hy love o! knowledge than 

. I'o e of^ain, and they were intereste ^ » 
knowledge as well as the particular field that they wtic 

themselves cultivating. ;„ „„orMi nnd 

As is- well known, the various trends in .r'^s^i ^ ^™ 
communication culminated in the 1°, 
nrPoval Society model and forerunner of many others 
s membersh 1 eluded artists and men of letters as we 
, c enlfsts and they were encouraged to write al ont 
1 eir wo no in high-flown esoteric language, but m 

7nd\simU Enghsh that corresponded o naUir 
Speech. The same ^-"""J-^ °, / V [ ^onlv, 

h s deep 1 owledge of the literary Slavonic language and 
[ron,?happy fusion of the latter with the language of 

'"^TRryafsft was not the first periodical publi- 
,liPr even in England. This had begun before the Ci 1 1 
W '• h ield of political news and eommen b. I 
Rnv^l Society's «Philosophical Transactions* was t. .ust 
p lirperUca, to have a c— 

Vt^S^n ^lt:"w.L^ cept^ the .Pl.n^ 

Transactions., were destroyed, it is ^^1°'^^^^.^ 
■ foundations of physical science would remain unshaken^ 
and that the vast intellectual progress of the ast two 
enturi would be largely, though inco-^P'^te'V; X 
ded " Even in the nineteenth century, however, the «Phi- 
loso hica Transactions,, had, inevitably, to .coneen rate 
f t Industrial I^evolution, on the specal tie d 
l e nhvsical sciences; the same vast progress had brought 
wUl i? a ge mb r of other such societies representing 
no t onW resear h scientists and scholars, but also pract ■ 
oners in the professional fields, law, medicine, enginee- 



iccn buhjec ed to now and intensive scrutiny. In particulnr 
\hl u°u 1 «^^^°""ii""ication processhas been tud in 
ti e ligiit of General Systems Theory, and the fechnianp 

process that is to Ijenifit from the use of comouters 

;£ s r*' *^^ 
sWhSL f.r£^if - ^^^^^^^^^^^ 



imagine (hat , i i f, ' "' V^is it is uMsU,k, to 
Nature, and tl „ sf I ,f /^'f i" 
example, much o thi n„ frequently made. For 

rary and document t,on svlfT'"-^,f^''-^''^ lil'- 
'y repetitive b s ° f'"'^"^ friie, or gross- 

sys(ems as part of a 1 rg l.,^' '° ^^^ard these 

^^rds them 'as ends in iSselves 

niati n^rise^tStrtn -'o- 
•he purpose "of'r iS irseTvicr;' ™fo^'"^ 

g t;f3:{:7nf:;;ti^ °^ 

aitiude s em to m" ro?i? '° P™P'= This 

T. KitagawMo th F b irv qvn'M .^"nrihufion of 

is sorefy sometl'ng'^ ' " ed' ,ie :t\'e° " 
pose OF continuous educa 0 for iMnn n f ■ I',"^' 
liim, «to work with mrl n , m society, to quali v 

in the cybern t c l er . 1° '^^'^'^^s and systems 

Commit eron Sd nTi i; nrf t''?' ^^1'°^' of "'^ 

tl^e U. S. Mationar A ^dem Ir c"'eti°"^:t"'"*;T °^ 
demy of Engineering spea! JoHhe S foT 

ianS': The"rd;:^^r,^^ 

to buy U^^^^ t r n^eVrer 

.r,o trhVinf^S-"Lf ?i^[: rr 

demned, andX co rl Li ^ ^ T'' ^'^"''s wn- 
expend effort to oMe vet'l"''™ '° P^""^"! i^ not to 
are alreadyl'erbu dened but °" 
cess as a unified system n itsel to '"^t 

lific Information ConWncel'Jsls^'fli'^Sed f=no[ 



the cessation, at least a drastic reduction of periodical 
publication, and the deposit instead of individual papers 
at a central distributing agency. He withdrew this propo- 
sal in the face of fierce opposition from journal editors, 
but a significant reminder was given at the 1955 \UPAC 
Conference on the Documentation of Pure and Applied 
Chemistry (which has been losl lo sight because it. was 
never published as a single volume) : 

«J. D. Bernal ... already in 1948 ... proposed that 
papers should be submitted to a central agency who sho- 
uld classify and disseminate thern to those who asked for 
papers on individual topics. 

Bcrnal's proposal was certainly premature, but it is 
not too early now to begin thinking of the position as it 
will be, say, in 10—20 years hence. When that time comes 
journals will be tMn'ce their present size, relatively few 
Fellows U'ill pay for them, and industry and the State will 
consider whether their subsidies which have become bur- 
densome are really necessary; whether some better means 
of recording and distributing scientific results cannot be 
found». 

This prophetic comment was made by R. S. Cahn, then 
editor of the «Journal of the Chemical Society», who had 
as good reason as anyone to see the approaching crisis. A 
further, much more detailed, memorandum was submitted 
in 196! to the FID by I. Polzovics, who has repeated his 
warning in the VINITI volume, «International Forum on 
Informatics» [4], complaining that meanwhile nothing has 
been done: «Bccause of its significance, it seems reasonab- 
le to ring an alarm that wakes up this Sleeping Beauty 
from its slumber». Many other studies in these two volu- 
mes bear on the problem, from all aspects; that of 
M. J. Menou [5], in particular, emphasises the necessity 
for an overall study from the point of view of the indivi- 
dual scholar, who is both producer and user,' and therefo- 
re marks both the beginning and the end of the knowled- 
ge transfer chain, or system [6]. 

In studying this system in detail, then, we -have to 
start with the individual. His purpose, put simply, is to 
increase his control over his own environment so that he 
can manipulate it for his own ends. This requires under- i 
standing of how the environment vx^orks; as we said, it is 
not the creation of his mind, but exists independently of 
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liis mind. Conlro! has lo be acquired; it is not iniKiie. Un- 
derstanding, as JeiHi Piaget [7] has ?ho\vn in liis long se- 
ries of experiments with children, comes primarily from ac- 
tivity, from interfering with the environment, from expe- 
rimentation. Tliis is how the scientist also works. But on- 
ce the mind has developed, through the processv^s of scria- 
tion and classification, to the point of being able to work 
with abstractions (symbols) which stand instead of real 
phenomena, we can learn from records and benefit from 
other people's reco rded knowledge just as productively as 
if we l;ad discovered it for ourselves. Since the extent of 
these records is now so vast, the research worker clearly 
needs some agency to scan the general output and direct 
towards him those items that deal with the field in which 
he is working. When Francis Bacon took all knowledge for 
his province, he was presumably able to perform this func- 
tion for himself, but the need for intermediaries became 
felt at least as early as the late seventeenth century, when 
the «Weekly M,emorials for the Ingeniuous» were first pub- 
lished in 1682, and the Royal Society reported notes abo- 
ut the work of the Ingenious in all parts of the world. 

We are now speaking, in fact, of the individual who 
wishes to be in communication with others in discussing a 
special field. If he knew all of them, presumably correspon- 
dence would be a direct, if rather tiresome channel. To this 
day, we use the channel for our intimate friends or for 
those far away whom we wish to interest in our work. But 
the attempt, two or three yers ago, to institutionalise this 
informal relation by Exchange Groups came to nothing, 
mainly because of the hostility of professional communica- 
tors such as editors and secretaries of learned societies. 
These are the organisers of knowledge transfer, the for- 
mer by their journals, the latter by their meetings, confe- 
rences, symposia and so on. They are necessary, of cour- 
se, because the scholar wants his work known not only Id 
his friends, but also to the host of unknown «colleagues», 
to whom he can be linked by means of their common 
bonds — membership of the same societies, subscription to 
the same journals. 

This is also a two-way relation, since the scholar as 
■user, as receiver of knowledge, also profits by the net- 
work, and learns about interesting work being done by 
•people unknown to him. This aspect is particularly impor- 
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tant for younger scholars (who will be the leaders of to- 
morrow); for although it may still be true that ^v^embers 
of the «invisible college», as D. J. de Solla Pric. Jias cal- 
ed the inner, senior membership of speciaUst groups, 
know of each others' work without the need for an infor- 
mation network, this does not apply to those who arc not 
>'ct members, those who do not sit on the impotant com- 
uiittees and thus meet regularly, with time to talk to each 
other. The workings of the leading group often extend, in 
fact, to the regular, formal channels also. Diana Crane's 
survey of sociological articles shows that the academic 
characteristics of the contributors and editors of three 
«prestige» journals v/ere very similar; the inference is 
that editors and referees look for qualities similar to their 
own in articles submitted for their consideration [8]. This 
applied even when the articles were anonymous, so that 
this similarity does not depend on the editors' personal 
knowledge of the authors. 

The question of assessment is crucial. When an author 
decides to make his work public, he desires to submit it to 
informed scrutiny by experts in his field. The very fact of 
publication should be a guarantee that the work has rea- 
ched a certain standard — not only in respect of contents, 
but also for presentation, due citation of previous work, 
and so on. There is a serious danger of breakdown here. 
The remarkable proliferation of new journals could be ta- 
ken as a sign of healthy growth in the world, of learning, 
but unfortunately it can also be due to over-commerciali- 
sation of the product, «information». The symptons are: 
lowering of standards, (usually due to absence of referee- 
ing), duplication (the same article published in more than 
one place), excessive publishing of «news» (either of little 
interest, or dupHcated), proliferation of commonplace «No- 
tes» or «Letters to the Editor», and, above all, by misuse 
of the practice of commissioning article from well-known 
•authors who very often have nothing new to say, or 
no new synthesis to attempt. The question is, should pub- 
lication standards be set by market place, or by the ob- 
jective assessment of informed specialists? There are dan- 
gers and advantages in both; it has been sugj^ested by 
R. D. Whitley {9] that there is evidence that editors tend 
to fayour articles that support the exising paradigm (in 
T. S. Kuhn's sense [10]), and are wary of publishing those 
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that criticise. This cofild of course incaji that the bias is 
iigainst (he gcniiiiiely novel, unci one ciiii indeed find ins- 
tances to justify this. On the other hand, a proposal to 
depart from the paiadigni should have to carry conviction; 
it must be more than usually well supported with evi- 
dence. 

Reliance on the market r.ow carries the stamp of 
SATCOM approval, because the survival of «for-profit» 
organisations depends on «their ability to recognize, un- 
derstand and adequately serve users, and because of the 
management and marketing capabilities they have deve- 
loped. . .»; but, as with ail commodity production, there is 
nowadays an obvious tendency to rely more on the marks- 
ting capability thaii on the satisfaction of a real need, or 
the quality of the product. The philosophy of «built-in 'ob- 
soIescence» has entered the communications industry, and 
we read of lengthy surveys and complex calculations of 
the «half-life» of specialist journals. So long as libraries 
(in advanced countries at least) are well-endowed, jour- 
nal publishers are assured of the minimum market neces- 
sary to make a profit, and the essential element of risk, 
if the SATCOM assessment is to be justified, is absent. 
Standards inevitably suffer, because factors ensuring ease 
of production take precedence over factors ensuring qua- 
lity of product. There is therefore much to be said for 
Poizovics' view that all specialist journal publishing sho- 
uld be irx the hands of learned and professional societies. 

First publication of an author's work, whether as a 
book, pamphlet, periodical or report, introduces him to the 
social group whose connecting bond is formed by their as- 
sociation with work in a specialised field and their real 
need to keep informed of the work of others, known or 
unknown. The editor's or publisher's role is that of spon- 
sor; he performs the introduction, implicitly guaranteeing 
that the work is both respectable and interesting. At the 
same time, he should be objective in his assessments, and 
equally guarantee to the author his right to publish his 
work — perhaps the mobt important argument in favour 
of control by the peer-group, the learned society. This is 
not the final judgement; that is reached by the decision of 
the «general public», if one may use that term of a speci- 
al field. The work achieves, or does not achieve, consensi- 



hilily, in Zimnn's phr;isc; if it does, it becomes «puh]ic 
knowlcdgo) [11]. 

At this point there enters into the network a new ele- 
ment: the librarian, documentalist, information scientist, 
or whatever name may be attached to those who have the 
social responsibility, not of printing and publishing origh 
nal work, but of organising and putting to use the result^; 
of the publication process. «Publication» is used here in its 
broadest sense: that once the report of tlie work has liecn 
«uttered», or given out, , it comes into the public domain 
and has to submit to an organising process so that its 
true purpose — that of communicating knowledge — can be 
realised. These «Messengers of Light», as Francis Bacon 
called them, bought to his New y\tlantis, «the books, and 
abstracts, and p;itlerns of e.xperiments oi all other paits», 
and today they piay an essential part in the communica- 
tion network, even though their value may still, in some 
quarters, be underestimated. 

It is not necessary to rehearse here either the func- 
tions of information workers, or the structure of the orga- 
nisations which they serve; Ibis has been done many ti- 
mes. «The Internationa! Forum on Informatics» contains 
a number of case studies, and A. Merta has applied sys- 
tems analysis to the international level of operation [12]. 
The social role of all the members of this group is the sa- 
me—to acquire, :i':i:.ange and disseminate^ the knowledge 
contained in records that have come into the public do- 
main. In this process, secondary records need to be cre- 
ated: abstracts, indexes, reviews. The bibliographical ap- 
paratus of a subject acts as a map of the 'territory; 
it is not the territory itself, but consists of a series of po- 
ints of reference, which a user may study even though he 
is. not on the ground itself. We may plot a route on a 
map. in our own homes, and we may compile a list of re- 
ferences in a library, rather than in the laboratory. These 
references bring a user into contact with an author, even 
though he may not be directly linked to that author by 
any of the formal linkage systems already described. 

The crucial importance of this function becomes clear 
if we remember that, unless an author's work becomes en- 
meshed into this secondary recording apparatus/lt will 
be lost from sight as soon as the immediate circle of his 
acquaintance cease contact with the field of study. Commu- 
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V;;r. \. otlicr orkcrs who arc unknown to an nwihm 
w.iiLnlKT ^eparnleci by spiico or time, is impossible withoul 
the aid of bibliography in one form or another. 

How, then, should this type of «secondary recording sy- 
slem:-> develop? Technically, there is only one way opvii; 
we must make the best possible use o'' macliiines desip',^:*'/^^ 
to handle large qii.iifii(UW> -o^ data, hm itib^ U'^'^^t be m 
light o\ users' ^upi!^K1m'nfVs^ not the oonvenieitce as cmi'- 
[jViiirf ;ivv;nt;Litaetiirers. The futility of much so-called rese- 
arch i'ii this area is demonstrated by the fact that we still 
do not know how to use the data produced by one of the 
most imaginative schemes yet designed to enlist the aid 
of computers — tile MARC Project. Other schemes which 
have cost a great deal of money to develop and a great 
deal of publicity to sustain, often fall some way short of 
their ostensible target. Much more thought needs to. be 
given to the psychological, linguistic and socixv'.iHii^ieal fo- 
undations of Informatics, before we kiin^^'^ ove' ij'ie st/ceci- 
fication to Ihp mg^^mns. 

There 'mpdiii signs that, on the sociologies. I le- 
vel, the requisite action has begun. The success of YINITI 
has encouraged other countries in Eastern Europe .10 es- 
tablish central offices for scientific and technical tnfoTnia- 
tion, some of which act as initiators of bibliograpSiiml sy- 
stems, some as co-ordinators. The U. S. National ALuidemy 
of Sciences and National Science Foundation Ivai-e for 
long urged that a proper study of organisational prob- 
lems be made, and their efforts seem to be beafmg fruit 
through SATCOM — though it is not without sigmJacance 
that the membership of this Committee is heavily wsighted 
in favour of the publishers and engineers: there are few 
scientists, no social scientists, no librarians, one Dean of 
a library school. They do, however, recognise the need for 
co-operation and co-ordination as well as competition, and 
the case for special grants for new services in libraries. 
One of the encouraging features of the Report is its view 
of the field as a whole, seeing documentation as an integ- 
ral part of the wider area of scientific research and com- 
munication. 

In practice, international co-operation is already under 
way in the fields of medicine, -physics and chemistry, whe- 
re learned bodies have taken the lead. They have already 
shown the advantage of this procedure, in their sensiti- 
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viiy to criticism and willingness to modify their schemes 
to meet the informed criticism of users, l^r future plan- 

KM?c?cJ"^*^'*"^^'''"''^ ^^^^ ICSU/bnesco project 

U:\1SIST represents an important attempt to find a cent- 
ral point, or hub, from which an integrated systems linka- 
MvlcM^'r'^^ '"^"^^St^^i- The organising committee of 
Ui\lblSr comprised representatives of all, ornearly all as- 
pects of the process of communicating knowledge. Organi- 
sationally, its range is excellent; but it seems to have "been 
forgotten, for the moment at least, that just as no man is 
an island, so no subject exists in isolation either. Where 
does one draw the line between «scicnce», and «social 
science», and v:liuman5ties»? Once again, we have to re- 
inember that, although distinctions can be made concep- 
tually, these cannot be maintained successfully and fruit- 
fully when it comes to the acid test of practice. Th*^re are 
many problems of the greatest difficulty facing the sche- 
me, however, and if the necessary syst'-m of controls, chan- 
nels, of information, facilities for initiating action where 
required, is to be constructed, it may perhaps be as well 
to start with ^m achievable target in view— always provi- 
ded that the overall objective is kept in the longerterm 
prospect. 

hop this is nothing less than entry into a new phase 
in the history of the communication of knowledge; when 
all the lessons of the past have been learned and 'under- 
stood, and the aims and aspirations of those who produce 
and those who use knowledge have been taken into acco- 
unt; when modern technological achievements have been 
mastered and brought into the service of Man, instead of 
•J;'ie other way about; when librarians and information of- 
ficers see their role as actively facilitating the transfer 
of knowledge, as well as guarding the records; then we 
shall have designed and buiit an organisation to co-ordina- 
te social effort for the achievement of a socially valuable 
purpose. 
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THE AGING OF SCIENTIFIC LITERATURE 

1. Scientific Literature as a Continuous Renewal 

In rocen't years emphasis has been laid on the expo- 
nential growth of scientific literature and the problems 
that follow from the so-called «information explosion». 
These problems look specially alarming to those who re- 
gard the development of science as a purely cumulative 
process, as an endless collection of new facts"^or data. But 
if such a view were valid all scientific papers would be 
of constant scientific interest whatever their date of pub- 
lication, and clearly they are not. The development of sci- 
ence is more correctly to be regarded as a process of 
con'tinuoiis renewal rather than of continuous cumulation. 
The literature of the recent past is constantly reviewed 
and updated; data already known are merged with new 
data and arc rcdescribed and reinterpreted in the terms 
of newer theories. Working scien'tists concentrate their in- 
terest on the most recent publications and their contribu- 
tions to the literature are 'to be regarded not simply as 
additions to a growing file but as corrections and refine- 
ments of papers published at some earlier period and 
which are thus superseded. If all the titera'ture of science 
aged ten years or more were accidentally destroyed the 
historical record of a great human achievement would be 
lost -but there would be little loss to science itself. 

Though practical problems which face information 
practitioners today stem ' partly from the increasing 
growth of new publications, 'they also arise from the spe- 
eding up of modes of communication and transport. Sci- 
entists attend conferences on a world-wide scale and in 
large numbers and so keep themselves informed of trends 
and developments by direct persona! contact. The tempo 
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of scientific work lias ii'!crGa&:jcl as a result and Ihc iii- 
creasing urgency of scientific activity lias led to pressure 
to ac:ce|erate the dis'lrlbution and dissemination of new- 
scientific information especially when it can be applied 
to technological processes. Computerized systems have 
aided this acceleration. In the recent past, emphasis hc.o 
been given to 'the increasing rale of grow^th of scientific 
publication partly because the evidence of this growth 
presents itself unambiguously and dramatically in the 
form of new periodicals to be processed. But the evidence 
of decay is less immediately discernible. As periodicals 
age, J^cientific interest in them gradually fades but the 
aging periodicals emit no discernible signal as they quiet- 
ly p^iss- from active scientific interest into the archives 
of the historian of science. 

Scientific information cen'tres therefore tend to carry 
a greater volume of obsolescent and obsolete material 
than is necessary. The greater the volume of obsolete ma- 
tcriiil that has to be serviced and searched, the slovver 
and jiiore costly all information services become. Sci'^'.itists 
have improved their chances of getting new information 
quicRly by changing from dependence on libraries of the 
traditional form to dependence on computer-aided in- 
formation centres wdiich, ignoring the tiierature of the 
pas't, began their operational life by processing only new- 
ly published literature.- The new computerized systems 
thus avoided current problems oi obsolescence, but only 
for 3 time. As the databank of any scientific information 
centre ages, so the scientific value of i'ts older files decli- 
nes: magnetic patterns on tape or disc are no more im- 
mune from the decline of scientific interest than are 
marks on paper. Computerized systems have not yet fa- 
ced the problems of weeding from their files the material 
that has already been superseded, but they wm'II soon ha- 
ve to- 
one way of reducing the effects of increasing growth 
is 'to iearn how to recognise and then discard any mate- 
rial thtit has become of negligible scientific interest so 
that all the material retained and serviced by the working 
library or information centre remains of significant in'te- 
rest. The material ripe for discard is disposed of only if 
a nationat or central archival repository of scientific li- 
terature can be relied on promptly to satisfy any referen- 
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00 -lo material ihal has already been discarded by tli6 
working library, otherwise the discarded ^material can 
only be removed from the mosi active ^^ea to some other 
area where it can still be accessible. But, once a «bank» 
has been estabhshed, such as the National Lending Li- 
brary for Science and Technology in Britain, 'the rationa- 
lization of obsolescent material becomes possible in all 
libraries and information centres which have access io 
it. The major problem of obsolescence is that though in- 
terest in the Merature of science continues to decline as 
the literature continues to age, theoretically at least the 
interest can never be said to reach zero. So arbitrary de- 
cisions atjout discarding material have to be made and 
a rationale is therefore needed. 

The volume of literature of active interest can be ima- 
gined as a wave-front moving through 'time (Fig. 1). 
New literature is generated continuously at an ever-in- 
creasing rate so the wave-front steepens and reaches a 
higher and higher maximum. But 'the new literature also 
begins to age, rap'idly at first and then more and more 
slowly, its trace backwards in time being theoretically 
endless. 

This present paper is concerned with techniques for 
measuring obsolescence. They have been devised 'to reduce 
the work of computation to a minimum, As the rate of 




FifT. 1 The volume of literature of active scientific interest in 

an expanding science. 

obsolescence is a function of both the literature and also 
of its users' interes'ts it is important that hbrarians know 
how to measure obsolescence rather than accept measures 
derived from other contexts and to understand the limi- 
ta'tions of any measure they use. 
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2. Analysis of a Typical Sample of Data 

In order tu provide numerical data with whicli lo il- 
lustrate the practical technique of measuring rates of oh- 
solescence, a sample of citations was 'taken from papers 
published in «Philosophical Magazine», vols. 19 and 20 

(1969). All these papers related to the same subject - 

solid-state physics. All the citations noted were to pa- 
pers already published so that references to papers «to 
be published)) or «in the press» were ignored. «Private 
communications)) even if da'ted were also ignored. The 
citations noted were mainly of papers published in pe- 
riodicals but citations to reports and monographs were 
also included. 

The citations were noted in three separa'te sets: all 
citations in (a) the January and February issues, (b) the 
May and June issues and (c) the September and October 
issues. These three sets of ci'tations are displayed as fre- 
quency distributions in columns (1), (2) and (3) of Tab- 
le 1 with the corresponding dates in years. The dates of 
column (4) are the sums of the three samples taken 
along the rows. The da'ta of column (5) are derived from 
column (4) by summing the citations cumulatively from 
the earliest dates at the bottom of the column. 

The first point to note about the three sample sets is 
that, in all three cases, the figures reach their maxima in 
1967, illustrating that there is an inescapable delay bet- 
ween the use of newly published literature and the indi- 
cation of this used by the appearance of citations to it. 
Having risen rapidly to their maxima the figures thereaf- 
'ter fall, in general, relatively slowly though with marked 
fluctuations from set to set in some years. It can also be 
seen that the years 1941—47 were unproductive of cita- 
tions as compared with ^the years immediately preceding 
that period. This low productivity is less easily accounted 
for by sampling fluctuations because, though the figures 
are all very small, they persist in all 'three sets over the 
period. Clearly this period of low productivity can be 
explained by the fact that, allowing for delay between use 
and published citation, this period coincides with the pe- 
riod of World War II. 

The total sample was split into three sets in order to 
illustrate the fluctuations that can occur when the negati- 



ve exponcnliiil (or geometric) distribution is sam;>;e(l. 
The oxpoiieiitial dislril)u(ion iias a thick short lieaci'-ind 
a long thin tail which reaches, tlieoretically, io inf'iniiy. 



Tabic 1 

A Sample of Citation Date (Phil. Mag. 1969) 



Date 


(1) 


(-') 






(5) 


1969 


— 


15 


24 


39 


1805 


19G8 


47 


8G 


71 


204 


I7GG 
1562 


19G7 


05 


87 


96 


248 


1 9GG 


76 


70 


G3 


209 


1814 


1 9G5 


'19 


51 


53 


153 


1105 


1964 


33 


50 


()7 


150 
147 


952 


1 9G:5 


54 


42 


51 


802 


19G2 


43 


29 


30 


102 


G55 


19G1 


37 


28 


20 


85 


553 


I9G0 


18 


11 


21 


50 


468 


1959 


33 


14 . 


20 


G7 


418 


1958 


14 


13 


21 


48 


351 


1957 


8 


15 


13 


36 


303 


1956 


II 


9 


18 


38 


267 
229 


1955 


14 


4 


11 


20 


1954 


7 


4 


7 


18 


200 


1953 


7 


7 


7 


21 


182 


1952 


8 


7 


10 


25 


161 


1951 


12 


4 


— 

10 


26 


136 


1950 


4 


5 


12 


21 


no 


19'19 


7 


3 


6 


1G 


89 


1948 


4 


2 


G 


12 


73 


1947 


1 


1 


1 


3 


61 


1946 




1 


1 


0 


58 
56 


1945 




2 




2 


1944 


1 






1 


54 


1943 










53 


1942 




1 




1 


53 


1941 


1 


2 




3 


52 


1940 


1 


ii 


3 


,7 


49 


1939 


1 


1 


2 


4 


42 


1938 


1 




1 


2 


38 


1937 or 


10 


11 


15 


3G 


36 


earlier 


567 


578 


GOO 

1 


I8U5 





The relatively large errors that can arise from samples 
of citation (or usage) data have not been recognised hi- 
therto in work on measuring obsolescence and ate' consi- 
dered in mc-re detail in Appendi?v II. 



11 is licl|)fui always lo splH samples in the way de- 
scribed al)ove. Tliere is no logical necessity for citations 
lo conform with the exponential or any otiier well known 
distribution. Though one would naturally expect to find 
a slow decline in numbers as the citation age, anomah'es 
may well appear. If the sample has been split it is easy 
to checi\ whether similar anomalies occur in all three sets 
or not. If the anomaly is found in only one set it is rea- 
sonable to dismiss it as a random fhicti'alion if no other 
cause is apparent. If the anomaly persists in all three 
sets, however, some causal 'explanation of., the anomaly 
must be sought so that, if necessary, allowance for it can 
be made if 'the results are to be applied in practice. For 
example one of the sets may include citations from a long 
and important historical review while the others do not; 
such an anomaly emphasises the need to ensure that the 
'total sample includes a fair proportion of the Iiistorical 
reviews that may occur in the particular literature being 
analysed. Tlie data of Table 1, for example, remind us 
that normah scientific activity was seriously reduced by 
the iast world war and that this is a factor which may 
sometimes affect the choice of date at which back runs of 
periodicals can safely be discarded. 

In Fig. .2(a) the data of column (5) of Table 1 are 
plotted on linear graph paper and in Fig. 2(b) ihey are 
plotted in 3-cycle semilogarithmic graph paper which au- 
tomatically takes logarithms of the numbers of citations 
but allows ^the time-scale to remain linear. The cumula- 
ted data of column (5) produce smoother graphs than the 
raw data of column (4). If the frequency distribution is 
exactly exponential or geometric then the graph plotted 
on semi-logarithmic should be a itraight line. It can be 
seen that the graph is approximately linear from 1967 to 
1947, 'that it levels out over the war period and then re- 
sumes its decline once more. By the time the graph has 
reached 1947 only about 3V2% of the total number of cita- 
tions remain so lha't, for all practical purposes, only tlie 
initial slope need be considered. The straight line imposed 
on the graph to fit the plotted poin'ts 'Was drawn by eye. 

The rate of obsolescence is reflected in the .slope of 
the linearity. This slope can be determined by regression 
analysis but for most practical purposes it suffices to esti- 
rnate the slope directly from 'the graph. A straight line 
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CD is drawn very carefully from the top lef -hand corne. 
of the graph-paper parallel to the straight Ime of the 
gra^ If the annual .glng factor is a, the line CD,s hen 
the graph of a' where 1 is the age in years. As t-1, 2, 
3 4 successively, 'the ordinates of the sequence of 
points correspond to t, t^ t3, t'. . . and theip'alues can e 
read as fractions or percentages on the f ^^'^o'j/'^fj'e I" 
hand side. Any convenient point is selected though the 
higher the power chosen the more precise is the value ot 
the aging factor derived from it. 

From the graph it can, for example, be seen that a = 
: =0.295, By succesive reference to square root tables wc 
then have 

a'*-0.543, 

a2 = 0.727, 
a = 0.859:=:= 0.86. 

This value of a can be verified directly from the graph 
by reading 'the value on the scale which corresponds lo 
the point on CD where t=:.l. . 

The fact; that a=0;86 implies that overall the number 
of cita'tions in any specified year is 0.86 times the num- 
ber of citations in the previous year throughout the prac- 
tical life of the periodical. 

Reference to the line CD can now save much calcula- 
tion in answering questions relating to the obsolescence 
of this particular periodical. For example, a second mea- 
sure of obsolescence that is frequently used is the lall- 
I life, h. This half-life is the age at which the usage of lie 
' periodical or set of periodicals has fallen to 50% of the 
total usage. It is therefore given by the age at which CD 
. crosses the 50% level on the graph, i. e. 4.5 years appro- 
ximately A better estimate can be obtained by noting the 
age at which the total usage has fallen to 25%, a period 
which will correspond to 2 half-lives. In the example we 
have used, we find that CD crosses the 25% level at 9.0 
years, thus confirming the first estimate. ■ 

The measures a and h are clearly related. They must 
satisfy the equation ai' = V2 from which 

hloga = — -log2 
or h = -~log2/loga. 
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. As a<l log a will also be negalivc so thai h is nosi- 
ive. In most practice,! calculations it will be foin d th^ t 
it IS more convenient ilo tise a than h 

tlnfqoor' 11"' shows, for e.xample, 

I ITf f-uT " '^"1? '^'i^ '^^^ ^J^^^" 15 years old 

sues of fh n 'r'T ^''"^ ^'"^1 tli^'t if is: 

60/ n fh were discarded after 12 years then 

Jt> /o 01 the references would be lost. 

3. The Concept of Utility 

Consider the geometric series 

S[0, -]=l+a + a2 + a3-f... to infinity (l). when a<l 
this series converges to the limit ]/(l--a 

omitted is "'^'"'"^ ''^'^'"'^ ^ ^''-^ 

S[l, <x)]=.-a' + ati' + af^=.fat+=+ ... 
= a* (I -ha-l-a^-fa^" -I- . . . 
= a'. S [0, oo\ 

and so converges to the limit a</(l— a). 

Tiuis the series obtained when we put t = 0 1 2 

T^'T'fy 1," ^ ecLetric series wilh , 

ratio a as m the series (1) 

The data of column (4) of Table 1 (apart from two 
thl V ^^''''^) appro.ximately proportional to 

uL ?^^' ?Vm '''''' ^'^^ d^^ta of CO- . 

to m. n7n • ' ' or!" ^PP'-O-^imately proportional to the 
terms of the series S[t. -j ,vith t = 0, 1, 2,3. .. successively 

lmonri^\°i^'°'!f' °f '^''^'^^^l literature produces a 
meanty when the cumulative sums are plotted as descri- 

. aoLir-'''^'^'^'' '"n ^''■•^Pl'^s that they form 
a geometric series with a ratio a. Wc have noticed that 
m using citations there is a delay of about two vears be- 
lore present usage of a paper can be relied on to appear 
rLL f\''''l subsequent paper. It is therefore not un- 
leasonable to ignore the first two points of the plotted 
graph and to base all calculations concerned with usage 
on the fi ted straight line only. Thus all calculations are 
simpiihed with no serious loss of precision. 
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Clearly S[l, co] is proporliotKil to the loUil number of 
cihilions ihat can be expected to be made to the literature 
when it has attained the age of t years. On the assump- 
tion that numbers of citations are proportional to usage, 
then we would expect the total usageof a newly published 
issue of a periodical which has an annual aging factor 
of a to be S[0, oo] which equals 1/(1— a), i. e. that total 
usage of the periodical is limited and tlic limit can be pre- 
dicted. The quantity 1/(1— a) is therefore important. It is 
called the utility factor and is denoted by u. 

Clearly, the more rapidly a periodical ages the smal- 
ler are the numerical values of both a and u, i. e. the to- 
tal usage decreases if tlic periodical ages more rapidly. As 
for solid-state physics we have estimated the annual aging 
factor to be 0.86, the utility factor is 1/(1— 0.86) - 1/0.14 = 
= 7.14. This result implies that the total usage or utility 
c! dc complete output of a periodical literature on solid- 
state physics is equivalent to only 7.14 times the utility of 
one. year's output'— a result which emphasises the ephe- 
meral nature of scientific periodical literature. It is a me- 
asure of the volume of the wave depicted in Fig. 1. 

In practice, in any particular special library or infor- 
mation centre, we are 'normally concerned with relative 
rather than with absolute measures of use. The special lib- 
rary will be acquiring a number of periodicals which all 
contribute papers to the scientific subject of interest. But 
some periodicals will carry more relevant papers than 
others and so will have a higher initial utility if w^e givt 
equal weight to all relevant papers. It is reasonable to 
assume that all the papers relevant to a given subject age 
at the same rate but as some of the periodicals carry pa- 
pers on other subjects also U cannot be assumed that all 
the periodicals age at the same uniform rate because pa- 
pers on different subjects may age at different rates. But 
as the library will be concerned only with relevant pa- 
pers, the rate of aging of the periodicals within that lib- 
rary will be determined by the aging of only the relevant 
papers they contribute. 

If the two periodicals Pi and P2 contribute Ni and N2 
relevant paper per annum respectively, then tlie residual 
utilities of Pi and Po of the t years will be 

lj[Pi, t]=kNiUat 
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and 

U[P2, t] = kN2Uat 

where k is a constant. We therefore have 

tI[P,. t]/U[P2, t]=N,/N2 

for all values of t, i. e. the residual utilities always rema- 
in in the same ratio as the unitial utilities of the perio- 
dicals. 



4. Synchronous and Diachronous Perspectives 

Sample counts of citation, usage or reference have to 
be taken over a relatively short period, otherwise the re- 
sults will be blurred. At the time they are taken such data 
provide estimates of the intensity of interest in the parti- 
cular literature, distributed over all hock issues of the pe- 
riodicals, as they stand at the time of the count. But does 
such a view of the literature as it is now enable us to pre- 
dict the obsolescence patterns of, say, 1981? Do the data 
we have been analysing, .which include data for the year 
1959 and earlier, reflect the pattern of obsolescence as it 
would have appeared in 1959? Are rates of obsolescence 
changing? Have they been affected by the operation of 
computer-aided S. D. 1. systems? Is a rate of obsolescen- 
ce affected by the rate of growth of the literature? We need 
answers to such questions in order to establish the relia- 
bility of the projections that are needed for planning in- 
formation systems rationally. 

In considering such problems it is helpful to distingu- 
ish two ways of regarding a run of annual periodical vo- 
lumes. The run can be regarded either: 

(a) Synchronously — when we consider the utility of the 
run as a totality now, or 

(b) Diachronously — when we trace the declining utility 
of some particular annual volume as it ages year by year 
from its date of publication. 

If both the anmial output of papers and their rate of 
obsolescence remained constant year by year, the diachro- 
nous tracing of the declining utility of some particular vo- 
lume through the years would be exactly mirrored by the 
synchronous viev>'.of the declining utility of the actual vo- 
lumes at present occupying the siielves. But most litera- 
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lures of science are growing rapidly and it is not known 
whether tlie annual aging rale is affected by rate of 
growth. Clearly, this is an empirical problem; the answer 
can be obtained only by making suitable measurements. 

For example, Fig. 3 shows the results of plotting a cita- 
tion count {5\9 citations) on articles (relevant to solid 
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Fig. 3 Cumulative distribution citations: Pbil. 
jviag. .1959 solid-state physics only (n = 519). 



state physics only) in the 1959 volumes of «Philosophica\ 
Magazine». The plot shows two linearities, separated by 
the war period, of distinctly different slopes. The annual 
aging factor is 0.89 for the pre-war and 0.80 for the post- 
war linearity. The figure for 1969 has already been shown 
to be 0,86 (based on a sample of citations). Are these va- 



77 



FILMED FROM BEST AVAILABLE COPY 



uos 01 a significantly different nnd, if so, am tlicv be re- 
lated to growtii or to any other measurat^lc factor^ 

Uearly, more ineasurenients of more iiteniturc are 
needed to clarify these problems relating growth and ob- 



5. Simple and Composite Literatures 

^h.Tln^''r^'^■'''^^^ solid-state physics, which provided 
he data for the illustrative examples of the previous sec- 
tions, conforms closely with the negative exponential law 
of obsolescence and yields linearities on semi-logarithmic 
graph paper as demonstrated. But not all scientific litc- 
raiures conform with this simple law. 

If a literatnrc has two or more distinguishable compo- 
nents they could be the theoretical and the observational 
aspects of a science, the theoretical and the practical as- 
pects ot an engineering subject, or the theoretical and 
vaxonomic aspects of a biological siibject — it may be fo- 
und that the two components age atf different rates. This 
effect was noted by Meadows in his study of the obsoles- 
cence of the literature of astronomy in which he found an 
«immediate» interest in theoretical astronomy, which mer- 
ges with theoretical and space physics, and a «historic» 
history ^^^^''I^^'^'^ astronomy which has a very long 

Unfortunately for the analysis of aging, even if the 
aging ol the two components each conforms precisely with 
the negative exponential law though at different rates the 
aging of the combined literatures taken as a unity does 
not conform with a simple exponential law. The effect is 
illustrated by hig. ^, m Fig. 4 (a) the two components P 
and g of equal size;, liave annual aging factors of 0.97 and 
0.81 respective y. In^Pfg. 4(b) the two components, of the 
same aging rates, 0.97 and 0.81 are in the ratio of 1 to 3 
it'^can be seen that in both cases the composite result of 
y'^^ ^^^'^ linearities does not provide a third linea- 
rity. Ihis means that this is no solution of the equatif 

where q is a fraction and a, p, y are constant aging fac 
tors, which is valid for all values of t, except for the triv^- 
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al c.'ise in whicli a = ^) = y. In general it can be seen that 
initially the more rapid rale of decay is liic predominating 
factor j)Lit that in the long run the slower rate predomi- 
nates. 

The problem of sorting out tlie components of a samp- 
le of citations from such composite literatures can be sol- 
ved graphically, if need be, by taking a sample which is 
large erough to give a reliable «tai]» of the older citations. 
The tangeiit to this tail is produced to run back to t = 0 
(l-ig. 5). The ordinates of points on this tangent are no- 




AGE (YEARS) 

Fig. 5 Graphical analysis of a literature with two components 

of different aging srates 

ted and subtracted from the ordinates of the original plot- 
ted curve using, of course, the logarithmic rather than a 
linear scale. The reduced ordinates are then replotted. If 
the original composite curve has only two components, 
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then the second plot may he linear; if so, then the two com- 
ponents will luu'c been scpariited. 

However, furtlier (lifficultics iire likely to be met. This 
'graphical :miilysis would work only if the proportions of 
the two components remained constant over the period co- 
vered by the analysis. If q is a function of time also, i. e. 
if the proportions of tlie components arc changing with 
time, as they may well be, then the above graphical ana- 
lysis will not yield clear-cut solutions. 

I-^rom a practical point of view also, any library or in- 
formation centre wliich provides an information service 
for a composite subject cannot efficiently serve both its 
«immediate» ai;d its «histcric» users with a single set of 
periodicals cut off by discard at some compromise pouil. 
The «immediate» users could justifiably complain of the 
obsolete clutter in which their own papers were embedded 
and the «historic» users could justifiably complain that 
tile compromise discard point was imfair to them, it 
would be preferable to separate the two components and 
to service them separately even though some of the cur- 
rent periodicals in the subject may be contributing to 
both components. 

Subjects of some breadth, such as «chemistry» or «phy- 
sics» or «medicine» are certainly composite, with several 
components. They are most unlikely to conform to simple 
exponential laws. This fact implies that ineasurtiments on 
obsolescence must be done on papers rather than on pe- 
riodicals and with good bibliographical control. 

In analysing the aging curves of composite subjects, 
the distinction between synchronous and diachronous per- 
spectives must also be remembered. A composite curve gi- 
ves only a syndironous perspective. It would be erroneous 
to interpret a curve as indicating that the rate of obsoles- 
cence of the subject has been changing, because such a 
curve may be the result of conflating two components which 
age at different rates. Studies of rates of change of obso- 
lescence require diachronous data, e. g. estimates made 
directly on the literature published at intervals over the 
period of interest. 

A general science library serving a number of scienti- 
fic subjects and several different immediate, historic and 
other kinds of interest will be- forced to compromise. It is 
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the easier to do so if it is backed by an efficient regional 
or central 'bank' of scientific periodicals. It becomes incre- 
asingly important to tbe development of efficient and eco- 
nomic information services, all grappling with the prob- 
lems of both growth and obsolescence, that hierarchical 
services with clearly defined ranges of literature should 
be established. 

6. Citation and Usage 

The data used in this paper were derivfd from citati- 
ons in a specified periodical and quoted by papers which 
contribute to solid-state physics. Such citations are assu- 
med, 'to indicate the actual use made of the literature by 
the scientists who contribute to it. They have the advan- 
tage of being objective in the sense that the same ci'tation 
data are observable by all who care to interest themsel- 
ves in them. 

But the exac't relationship between the actual library 
usage of tiie literature and the use of citations is not 
known. If the library is serving users who are not all 
contributors, then citations are most unlikely to be indi- 
cators of actual usage. A university library, for example, 
provides for research scieriLis'ts, students and historians 
of science and possibly for general readers also. In such 
cases the use of citations must be abandoned for records 
of actual usage within that particular library. 

It will almost certainly then be found tha't the data do 
not conform with the geometric or any other standard dis- 
tribution. If a measure of obsolescence is required, the 
mathema'tics and measures used in this paper do not app- 
ly. The only solution is to express the cumulative data as 
percentages of the total, to draw the graph of these per- 
centages (log scale) agains't time (as before) and , then 
make all estimates directly from the graph and without 
the use of any measures derived from the geometric di- 
stribution. 

7. Obsolescence and Growth 

Is the rate of obsolescence of a scien'tific literature re- 
lated to its rate of growth? This is an empirical question 
which can be answered only by measuring the rate of 
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obsolescence of ji growing literature at intervals. But, a^i 
differences arc likely to ha small, 'the ■measures nmst be 
based on ve'-y large saniples to ensure that any differen- 
ces found are statistically significant. It would therefore 
be helpful to consider the question theoretically to see 
what might be expected and to formulate hypotheses to 
be listed. 

If the rate of growth of the literature is measured in 
terms of the number of p.ipers pubtisiicd per annum, it is 
reasonable to assume initially that the productivity of 
the contrihuting scientists remains constant, or approxi- 
mately so, and that 'the growth of the number of papers 
published indicates that there is a concomitant growth 
in the number of contributors. It will also be assumed 
that, overall, 'the new contrinutors use the literature and 
in particular, distribute their citations as the contribu- 
tors whose ranks they join. This assumption implies that 
the utility of the literature increases at the same rate as 
the number of contributors. 

If g- and s are the annual £-owth factors of the num- 
ber of papers and of the number of contributory scientists 
respecii-. ely, then 



and 



N(T)-N(0)gt 

LI(T) = U(0)st (2) 

where N(T), N(0) and U(T), U(0) are the numbers of 
papers published and 'the utilities in the years T and O. 

The ulilities are also related to the correspondmg an- 
nua! agin^^ factors, ao and a-r, by the relations 



1 x 



(O) 



and 



N (T) 

U(T) = j^ (4) 



We now seek the relationship between aT and ao. Ihe 
terms (U)T and U(0) are eliminated from {^2) by substi- 
tu'ting the. expressions (3) and (4) to give 
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Tlien, from ( 1 ) 



N (O) g 



T 



N (O) s"^ 



a 



or 




which yields 



T 



a 



o 



T 



1 — a 



I-(g/s) 



o 



The expression on the right hand of (5) is positive if 
s>g', zero if s = g and negative if s<g. So ax is greater 
than, equal to or less tlian ao if s is greater than, equal 
to or less thaj g. At least 'these are the results that \vo- 
Lild be expected on the stated assinnptions. 

It can be seen therefore that any change in the value 
of the annual aging fadtor is likely to arise only if -there 
is a change in the average productivity of the contribu- 
ting scientists as measured by the nunnjers of papers 
they published per annum. 

The measure of productivity in the direct sense would 
be difficult to estimate because there is no accessible re- 
cord of contributing scientists who do not happen to pub- 
lish at least one paper in the period of measurement. So, 
by various sampling techniques, estimates need to be ma- 
de of g, s, ao and aT for som^ 'convenient value of T. 

Estimates of these quanj,i1;ies as they appJy to solid 
state physics for the years ID63 and 1958 have been made 
by Oliver [2]. The results are: g= 1.134, s= 1.150, ao==0.78, 
aT = 79. Her consideration of the errors arising from her 
sampling techniques leads her to conclude that the diffe- 
rence between the estimate of g and s is not significant 
at the 5% level. As the annual aging rates, based on ci- 
tation counts, are equal, Oliver concludes that the. above 
theoretical analysis is corroborated, i. e. that rates of" ob- 
solescence are not sensitive to rate of growth but are sen- 
sitive to changes of productivity. 
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Vlacliy [31 lias rcccnlly shown that rhaiigfs are ta- 
king place in -the degree of nnilliple authorship of scien- 
tific papers. It may tlierefore be necessary to consider 
how these changes effect measures of «productivity» and 
so a further factor is introduced into any analysis of the 
relationship between growth and obsolescence. 

8. Applications 

Tlie main applicati<^'i of measures of obsolescence lies 
in the rationalization ot the periodical stocks of special li- 
braries and in the planning of i!i formation systenis. 

If in a special library the subject of scientific interest 
happens to be 'simple', 'such as solid-state physics, then 
we can assume that scientific interest in the periodical 
stock declines as the stock ages, in the case of solid-sta- 
te physics at a rrie of 14% per annum, in a way which 
spreads this rate of decline uniformly over the complete 
runs of all the contributing periodicals. Especially if the- 
re is a central library able to supply copies of eariy pa- 
pers required, there seems ^to be little piirpose in retaining 
back numbers indefinitely since they take up shelf-space 
tnd require servicing even if they are rarely used. But 
Vv hat should be the guiding policy for the librarian who 
wishes either to remove the h'ttle-useu volumes to a back 
s'tore -or to discard them completely? 

'H-'Mierto, where discard has been practised after the 
rate of obsolescence has been measured, all periodicals 
have 'been discarded at the same age to leave equal runs 
(measured in years) on the shelves with immediate ac- 
cess. This technique i Mmple to apply but i't takes no 
account of the fact th. oome periodicals are more useful 
than others. The result s that the most highly productive 
periodicals are discarded a't an age when their residual 
utilities are still re' Mvely high whilst the least producti- 
ve periodicals are retained for longer than their useful li- 
fe. This technique is therefore still wasteful of shelf — 
space and is clearly not 'the best solution to the oblem. 

• The principle that is proposed relies on the concept of 
utility. Some acceptable minimum utility is decided upon 
and a periodical is discarded only when its residual, uti- 
lity has declined to this minimum level. The result of ap- 
plying this principle is that the runs of retained periodi- 
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cnls iirc 110 Inngor eiiiuil in gonoriil; Uic niosl useful nc- 

'iro discarded °^ "^l"^' "'"'''^^ 

If the minimum acceptable level of utililv i c iho 
'il^rdcd u UHy, is d, then the age of discard 'of any pc' 
nodical with initial utility i is given by ^ ' 

t (Jiscnrd ut ility 
^ ^ iiiilinl utility '"^fl/i (1) 

If a, d c)nd i nrc known, the corresponding value of t .-n 
1)0 read from the giuph of a'. ' ^ 

The remnining problem is to quantify d and i though 

ovoT nf ? s'"^PJf t way is to consider the minimum 

level of acceptance of periodicals. In any real libiary "^o- 
me periodicals will have been selected for acquisition aiid 
others consider.-d for possible acquisition, will have been 

subiect m'^); ^'^^'^ ^'^^^ relevant to the 

subject. It should therefore be posMble to examine the 

mos margmal of the periodicals acquired and to determi- 
Mnn m.'''^?'"*"'" ^^^-ruhcv of relevant papers, n,, it yields per 
annum. If some other periodical already acquired yrelds 
no relevant papers per annum (n2>n,), we have 

at~d/i = ni/n2 (2) 

as required. This method has been described in more de- 
tail elsewhere'*. 

nff^nlT"!' determining the required cut- 

off may he based on the cost of acquiring loans or photo- 
copies of tl e papers called for from the tails that may 
be discarded. As estimate is needed of the "total cost of 
acquiring a loan or pho^r^^.^py, including an estimate of 
the cost oi the c.ieriC:nl w.>Tk involved and the cost to ihr. 
borrower of in- dv^lay acquiring the required copy ex^ 
ternally as wci! ..f -.vage and any other direct charges. 
i.et his tot^i co-^ .V. monetary units per photocopy or 
loan. Tiie prmcip:. 1.^ >c applied is that the periodicals are 
a]:-,c.:)> ivnen th(i cost of the substitute -photocopies be- 
comes le^-. ...an the cos'ts of letaining the periodicals on 
nnmediate access. 

-^iMir^ ^^J?, (t+l)th year of the life of any periodical 
Its utility will be (I— a) or aVu. The number of de- 
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niaiuls nuulo lor [kiikm-s wiliiiri ll.iis particiilnr ;mniial vo- 
lume. will depend on Iwo factors: first, llie average niiin- 
her of relevant papers it contained initially — say n — 
and, secondly, the intensity of local usage, say b, of the 
particular pc'riodical per annum. This factor b, which will 
varv from one library to another, can best be determined 
by noting the number of demands for it during any tem- 
porary absence of the periodical such as when, for exam- 
ple, il; is senl for binding. (iBecausc demands f{)r a perio- 
dical decrease as it ages, some correction, which can be 
obtained from the graph of a', may have to bo made tc 
the value of b derived from the count of demands). The 
total number of demands made for papers in the volume 

of age (t + 'l) years will thus be-^^-^^. 

The total cost, s, of retaining the periodical on the shel- 
ves for one year, again allowing for its proper share of 
current servicing costs, is also needed. The average cost 
m a demand for one of the papers it contains will then be 



su 



-This will come at a time when tl\e quantity (2), which 
increases as t increases, becomes greater ^than p. At equ- 
ality we. have 

su 



a* rib 

or 

pnb 



and the value of t can be found by reference to the graph 
of a'. 

Clearly this second method demands a greater detailed 
knowledge of the costing of the library services than is 
normally available. 



APPEiNDlX 1. 



' The negative; exponential nnd geometric distribiit-ions. Because 

; time is a continuous vta.riable, obsolescence is a continuous process 
■ for the tlieorelical analysis of wWch the continuou-s negative expo- 
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neniia! is thc> most npproprialc c]islri!)ui:Dn. The pmh.-ibililv JcnsiJv 
fiinclion is 

P(T)-= ^c--'^^", 0<.T< (I) 

where a is a cou [ant. But .in npplyin^ the forniiilr-i lo Uie afrlnrr of 
literature, wc find that dates of citation arc iisiiallv available only 
in terms of the year of jniblicalion. The conii-; ■ ous "time variable is 
thus broken up into a sequence of discrete pciods of on-e year. We 
thus count the number of citations to papers published in particular 
years and lake the year as the unit of time. 

In a distribution of citations which conform with (I) above, the 
probability, P(t). that a particular citation is earlier or older than t 
years is given by 

oo 

?P (t)=^\4^"'^''''"MT 



If wc put c ^a, wc have 

P(l)=-ai (3) 

which forms a rreomeiric scries as 1 lakes the values 0, 1, 2, 3,.... 
in sequence. 'The probability of a citation other than t years but not 
older than (t+l) years wi:ll then be 

P(t) — (P+l) = a» — 

= a»(l--a) ' 

which is the general term of ihe basic geometric series used In this 
paper. 

The diiscrctc geometric distributions (3) and (4) are thus used 
as convenient avid close a pproxiniatio-ns to the negative exponential. 
A sinu'lar argument shows that in the anailysis of growth a geomet- 
ric series, with a factor greater than one and therefore divergent, is 
an equally convenient appro.\imation for bibliographic analysis to 
continuo'us exponential growth. 



APPENDIX ir. 

The sampling error of the geometric distribution. Measurement of 
the rate of obsolescence always invo-lvcs the sampding of citations 
and sampling always involves possible errors howevor carefully the 
sampli^ng work may be done. Any particular value of a measure of 
aging derived from a sample must be regarded not as the true value 
but merely as a single random sar ple of all the possible values that 
could be derived from other diff-erent samples of the same size taken 
independient'ly from the same source. 

The exponential or geometric distrLbutions wihioh underlie all ob- 
solescence measures give rise to relatively large sampling error? 



hich nuiv nccounl fur some of U. ^ :!p!>:!renl IneU ol '^^^^^^'^ ^'^l 
T^<^n'^ '-cportod bv (iiffc:cnt workers on the same- body oi .it 
A M \vhh^h have 'led lo snmo lUK-crlniiily in nppltcnlton. ThL•^c. 

erri;s rc^^^^ discussion wUh parlicuh.r reiorcnce to 

ic saniplinff of Ihc seomeiric distribut:nn _ n -I- - -4- -r^ -f- 

[\ Iho geometric distribution hns the iorm (1— a) (1 a f^... 

..a^+.-.to infiiTiW then the mean of this di.slribui!on 1^7—7 
ud its sUnd.ard devialio.n, which is a measure of the spread or dis- 
)e>rsion of a dislribut.'ori is 
a. So \vc write 

i — n 

mean, ^i— a/(i~a) 

s. d., cT==a/(l— a), 
r^tiivrntes of ilie mean derived from samples of size " ^^-^I'l^* 
orn, a ^-.ccond distribution which has l!ie same mean, n, as the (h^: 
ribution which is sampled but wilt have the smaller deviation nj y n 
rh?^ scco d distribi.tio^ of estimates of the mean w,ll I'''v.= ' 
crent form. If n is .larfro enou-h, as will usually be the . 
.uri K obsolescence, tlfo second distribution can be j'ssunied to in, 
lormal or gaussian. This distribution is symmetrical about its moan, 

^^ '"Thestnn lard deviation of the dcstrjbution ol estimates of the 
no^n will therefore be /(I— a) V n. U is known that 95% of 

lence 50/0 of the estimates will have en error greater than 
-J,1.96cfyn which for the geometric distribution becomes ± 1 -Do 

To'^xpJ^s"' this approximately as a percentage error we divide 
Ihe nbos'c expression by^i and mviltipyl it by 100. After reduction the 
required percentage is 

iioo/yan (n 

If wc wish the error to be less than 10% of (with a proba- 
bility ot 95%). the criticr^i sample size is given by ~^=]0, from 
which 

o« — 10 ns ~ 400, 

As a<l, M>-;00, i. c. the minimum sample size for obsolescence 
measures wi h reasonable cxpr.-ctation of yieldnjg an error of I^^s than 
0% n 400. a figure which increases to 500 iti a ,s known 0 be 0 80 
or exam^ple. Unfortunately, precision increases only a^s the square 
•oot of the 5.ample ^^ize so that to reduce the possible error to 0/0 
wniild renuire the sample size of 500 to be increased to 20t)0. 

''"There isSioe^-c.p^e from sampling errors whaV.ver obsolescenc^ 
measure may be selected ,and whatever technique may be used to 
Hprive it<: value The errors are .inhereni m the sample. 

' Even the abive discussion of errors is based on the assump ,on - 
t-hal the sample has been well ranclonuzcd. M has also become obvt- 
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Fig. 6 The error d.istribuiion of Iho mean of a geometric 
scries with a = 0.80. (n=-400). 

OMS that the lileralure whoso obsolcseciice is beinj? measured must 
be well defined hibliographicaily so lhal others who may be intcres- 
resuUs" "Measure on their own sources and gel compara.bit 

In qnotinrr the results of measurements on obsolescence it is tho- 
reforc necessary always to quote the sample .size and to specify the 
literature studies as precisely as possible. * ^ 
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J. Wankc 

(Czechoslovakia) 

CONi LICTING PHENOMENA I N I R SYSTEMS 

The art of Infornialion Relricval (IR) has reached a 
level where standard technologies and methodologies are 
being accepted. In discussing the theoretical aspects of 
this art, no distinction is genera'-y made as to the diffe- 
rent practical application branchc-s of IR Systems. In 
our discussion, we shail treat the distinctive features of two 
extreme cases of IR Systems, viz. what we may call :i 
«lrue» documentation IR System (say, a patent library), 
and a business data IR Systems (such as for inventory 
control, portfolio valuation, or flight reservation). 

Essentially, there is no difference between searching a 
documentation file or c. business data file, as both systems 
use much the same requisites and technologies. On the 
other hand, they may differ — and in most cases actually 
do so — in certain points, the importance of which has 
been somewhat neglected up to now as compared to the 
; importance attributed to other system parameters (e. g. file 
size, access rate, relevance, noi e, recall ratio, etc.). 

The por-nts considered here are: (a) file utilization, 
(b) meiT'n record ' lifetime, (c) mean record length, and, 
(d) mean item value. 

(a) First of all we shall try to define a measure of the 
utilization of the file, or of the «turnover» of the informa- 
tion stored. Let us relate the size of the file to the num- 
ber of queries requested during a given time period by the 
simple formula 

D-q/F (1). 

where D is the daily duty factor, or measure of the file's 
utilization. 
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is Iho cveragc number of queries per working day, and 
F is the size of llic file {niunbcr of records, each record 
corresponding to one item). 

In a more refined form U'hich will be discussed later, 
the measure of the :ie's utilization will involve also the 
volume of an answer averaged over a high number of 
queries. 

The (daily) duty factor D depends on the nature of 
the service. In a news agency with several million records 
in the file, about 600 simple searches (e. g. for biographi- 
cal data, photot^-raphs etc.) are made daily which corres- 
ponds to D of t, order of 10 On tiie other iiand in a 
conventiojr 'DC file in tlie Prague Central lechnical 
Library w out the same number of records, some 

2000 scare. .. u ure made in 1960, i. e. less tlian 10 sear- 
ches a day on the average. This corresponds to a D figu- 
re two orders of magnitude lower than in the previous ca- 
se. ) ■ small peek-a-boo card fWcs v/ith a few thousand re- 
cords a typical D figure mig^d; be somewhere in 'the 10"^ 
through 10"^ range. Generally speaking, in a given field 
of knowledge, D is to be expected to be invariable over lar- 
ge ranges of file sizes. 

(b) Moreover, there is also a difference in the avera- 
ge life time of a record in the file. Thus while the ave- 
rage life of a record in a flight reservation svstem will 
not surpass a few weeks, an item in a documentation re- 
cord file stays there for years. 

Therefore the price of a bit (or byte, word . . . etc.) 
stored is not determined only by the total storage price vs 
storage capacity ratio. A time factor corresponding to the 
mean life time of an item should be also considered. Jn 
the old days of paper card files, a long lived record requ- 
ired not more than some drawer space. In modern niass 
storage media it is the time the space remains ncruoipd 
that counts. 

(c) In a business data file a record may vary in length 
betvvcen a few characters and hundreds of characters 
whereas the length a typical documentation record be'' 
gms where the length range of business data records ends. 

(d) As for the item's mean value, in business files the 
cost of Svoring and processing each individual record uses 
to be discounted from the value of the respective item. 
Needless to say that a business system would scarcely cap- 
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liirc jm item of a value lower than the storing and proces- 
sing cost. On the other hand, Ihe «vahie-,> of an individu- 
al documentation record is very vague and might be de- 
termined but indirectly e. g. by averaging in from daia de- 
rived from the overall effectiveness figure of the file. 

When taking business IR and documentation IR Sys- 
tems as two extreme cases of the same problem, we can 
illustrate the foregoing parameters in a Table: 



PnratnclLT 




l,iti(Hi I'ilo 


Utilization (duty factor; 
Record length 
Recoia lifetime 
Item value 


high 
short 
short 
adequate 


low 

long 

long 
■> 



Any two or three of the above parameters may be rep- 
resented in a two- or three-dimensional diagram, respec- 
tively, e. g. 

/Record Eifetim" 




Duty factor 



Record Bength ^ ^ — -j^-^ 

The respective positions of typical documentation and 
business data files are labeled D and 3, It should be noted 
that they lie near the opposite corners of a cube formed 
by the coordinates. 
Remark: 

A more refined method of evaluating the utilization of 
an information file would include the average number ns 
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01 rcl-vant records found within a sciireh. Bv tnultipiying 
n.< by the daily dvUv iiiciov D and hv the expected mean M*^ 
Ic time t of a record we get \vhal"wc might call the pa- 
geirg factor p of the records in the file: 

p-n,.D.t (2;,), 

or 

p = n.s.q.t/F (9h) 

Figures of p met with practical systems show that 
wliije documentation files do not score more tlian about 
10^-, a business or data bank file may reach a value of 
10'' or even more, altiiougb the mean ii'fe lime of a record 
in the latter is generally short and a typical answer mav 
contain but a single record. In other words, in the busf- 
ness file types it is the high utilization rale of each record 
wnich IS responsible for the relatively high p values. Each 
record in a business file is paged many times during its 
short life wliile in a documentation file just a minority of 
information stored is called up a few times during the 
file's entire long life. 

Now lot us return to Ihe impact of the above factors 
on the system's design. Most of our documentation IR sy- 
stems are sequentially operated when searched, i. e. who- 
le files (or substantial parts of them) are scanned in each 
search r:m, independently from how many queries are 
involved. Such a system necessitates a cumulation of que- 
ries, i. e. batch processing. 

The problem does not lie so m\ich in the fact that of 
course real-time processing is inconsistent with cumula- 
ting batches, as- in the long time needed for gathering a 
batch big enough to be worth processing at all. 

The user feels attracted by lip-to-date types of servi- 
ces and appreciates their high speed and reliability as pro- 
vided by computers. On the other hand, he dislikes giving 
up facilities he got accustomed to, such as browsing pos- 
sibilities, immediate or at least short term ^.nswering of 
queries, and parallel searching. 

In a typical sequentially scanned documenttition IR 
system the cumulation period rnay attain as much as se- 
veral days for reasons of economy; thus the possibilities of 
man-machine .conversation or browsing are severely const- 
rained. Actually, the overall speed of u sequential scan- 
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ning IR syslcni willi query ciiniiilalion is lower Ihaii Ihc 
speed of most iiiiiniial systems of (lie i^anio size, 

Any correclion of a query resulting from a not fully 
satisfactory previous search- result -which is a procedu- 
re inherent to documentation IR- introduces a delay equ- 
al to the cumulation period, Thus the ultimate volume of 
any sequentially scanned file is determined not only by 
the technical parameters of the computer, such as magne- 
tic tape length and storage density, transfer rate, etc, and 
l)y the retrieval language .parameters, l)ut primarily by the 
duty factor D. 

In documentation IR systems- as opposed to business 
data files- the duty factor figures are inherently low, 
which is due to the quantitative discrepancy between the 
needs and the volume of information stored. Several case 
studies of research and/or development projects have shown 
'that the queries requested are not high in number du- 
ring the whole project period. Typically but a few large 
searches were required within several years of research, 
not counting a lot of com|)lenientary or au.xiliary searches 
which, paradoxically, aimed into ether fields (e g, a 
wiring diagram of a voltage regulator was searclied for 
in a biological research project). 
Example: 

For a modern medium size computer with fairly fast 
logic and a quite primitive indexing and retrieval langua- 
ge with no syntactic rules a scanning rate of 10^ records 
per hour may be quoted as a representative figure. 

Assuming a cumulation period of 24 hours as just 
adequate and allowing a rather high cpst of each query 
equal to one computer working hour, 'and neglecting all 
other variable factors, would lead us to a rough estimate 
of an economically justifiable duty factor D=10"^ This 
figure corresponds 'a about 2500 queries a year for a file 
of 1 mil, records, which is far more than the figures met 
with documentation IR systems. On the other hand, it is 
below the figures representative for business data files, 

Ihere is but one way out of this blind alley: some kind 
of inverted file in a random access memory At the time 
being, the only applicable storage media, are the magnetic 
disc or drum and, the magnetic card. In the near future 
e. g. photoscopic memories with, coherent light read-out 
might be the answer, For inverted file systems the duty 
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factor is not as important as for sequentially scanned sy- 
stems because no query cumulation and hence no substan- 
tial answering delay is involved. Nevertheless the duty 
factor provides a rough indicator of the utilization of the 
systejn. 

With inverted files real-time searching is feasible and 
no query cumulation is needed. Search time are much 
shorter than with sequential scanning but the economies in 
computer time are traded-off for high bit price. Therefore 
business data files with their short record length, short 
record lifetime and high item value are more suitable be- 
ing for stored on RAM devices than documentation files. 

Calculating the storage capacity requirements for a fi- 
le is simple arithmetic. For documentation, at least 300 
characters or some 3000 bits per record stored are to be 
counted with. Thus a medium size file of 100 000 records 
will require a storage capacity of about 3.10^ bits which is 
much but still within the range of RAM stores. What is 
worse — besides the relatively high bit price of such de- 
vices—is the fact, that each bit element in the store will 
be occupied once for ever. 

Therefore, big computer documentation files have to be 
waited for until the time when the price of accomodating 
information may be neglecter. This goal is being reached 
by now by the magnetic tape it is far ahead for other 
high-capacity and short access-time storage media. The 
hardware and software do not match the requirem.ents of 
documentation IR. It dawns to me that the manufacturers 
of computers have not yet begun to consider the documen- 
tation IR market interesting and extensive enough. Other- 
wise they would have provided this field with more spe- 
cialized equipment. Up to now documentation retrieval 
hardware has followed wi'th few exceptions the way trac- 
ked by accounting machines. 

The above considerations may at least partly explain 
certain observed facts in the IR field, in particular 

— why business IR files are more successful tha.i their 
documentation counterparts; 

— why business IR files are better suited for inverted sto- 
rage on RAM devices; 

— why the documentation services had to look for the way 
of making indexes as an interim solution which' would 
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have taken the place of the not fully salisfactory cent- 
ralized IR systems. 

In conclusion another inconsistency in the IR practice 
should be pointed out: While projecting an IR system, the 
designer tends to make the most of the informalion that 
is going to be put into it. In most computer oriented sy- 
stems, IR retrieval is complemented by some type of cur- 
rent awareness service such as SDI, printed subject and 
author indexes, and the like, so that f ^ ii record is disse- 
minated in one or another way before 1 -Mig stored for ret- 
rospective retrieval in a more or less d'siaiit future. It has 
been shown that the economy limit of an IR system de- 
pends to a high degree on the magnitude ■(}{ the duty fac- 
tor, i. e. — for a given file — on the number K)l queries \n 
a reference time period, e. g., a day. But providing SDl 
and indexing services— not to speak of reproduced mag- 
netic tapes and punched cards dissemination -~ is at odds 
with the needs of a centralized IR service because many 
potential cvistomers of the IR service are drained off by 
making sevarches by themselves in their own files and in- 
dexes. The designer of any information system is thus 
confronted with the dilemma centrah'zed IR or indexing 
service, while primary economy considerations would press 
him into a solution IR and indexing service. 

This is a point where more detailed investigation into 
the interaction between IR and its derived or conjugate 
services is needed. Rough estimates indicate, that a half- 
and-half sharing of the respective duties might be expec- 
ted. Moreover, the conjugate use of the same format (say, 
abstract) for both types of service is not always efficient 
b'ecause it is clear that an abstract made for current awa- 
reness purpose or simply for «news» dissemination cannot 
be adequate several years later when incidentally paged 
off a file in an IR search. 
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R. A. Fairthornti 

(United Kniydum) 

PROBLEMS OF DATA RETRIEVAL AMD 
DEPENDENT TECHNIQUES. 



This paper outlines tlie basic 'problenis underlying Da'la 
Retrieval and the hierarchy of techniques that it sup- 
ports from 'look up' through 'question answering' lo 
'answer inferring' from records or text. Little of what is 
said here is novel. The problems described are being stu- 
died by many workers. Here I attempt only to sort out 
the rather tangled issues involved and to use them 'to put 
some important techniques into perspective. 

A very necessary prelimmary is to separate these va- 
rious teciiniques from the even more various a-divi'ties 
they aim to aid. To omit this is to risk being like those 
who cannot discuss the building of a bookshelf unless 
they bring in and discuss the psychology of authorship, 
the economics and ethics of book publishing, and 'the so- 
cial effects of libraries. All the >c are matters that must be 
discussed, but not whilst constructing the bookshelf. So 
here we assume only tha't, for reasons outside the scope of 
this paper, it has been decided that techniques should be 
available for finding segments of what people have had 
to say, in words or writing or pictures or sounds or other- 
records, a-nd then to make inferences from these segments. 
The mo'tives for these inferences do not affect us directly. 
The modes in which people talk and record and publish 
for various purposes do affect us directly. 

For the same reason we must ignore procedures for 
verifying ivhat people have had to say and of estimating 
confidence in it. Although verification and weighting of 
sta'tements are essential to the proper use of documents, 
to include them as a ipart of data retrieval Vv'ould be cir- 
cular. For verification and confirmation themselves invol- 
ve data retrieval and cognate techniques; ior example, to 
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find ou't what others have had to say about the matters 
retrieved and how rehable they consider the author. Verifi- 
cation by direct experiment - or obscrvaiion of events 
themselves rather than of what people have had 'to say 
about them, lies completely outside the scope of Data- 
Retrieval or, indeed, of Informatics itself. Menus arc not 
'the same as the meals they denote, nor arc libraries to be 
confused with laboratories. The function of libraries in 
particular, and of Informatics in general, is not the pur- 
suit of knowledge as such, but of knowledge of what pe- 
ople have had" to say. In short, Informatics is concerned 
with management of messages, not with their creation 
or application. 

One key task of message management is to isolate so- 
: me Iragment of a text or diagram or record without com- 
pelling the reader to hunt through all of it. Immediate 
examples are the provision of the numerical value of some 
function for a specified mathematical function; of times 
of departure and arrival from, and at various places by 
rail, ship, or air; of a paragraph or sentence rnen'tioning 

■ the name of some person, place, substance or event; of 
the part of a map, or pean or circuit diagram tliat 'con- 
tains some named part or some specified configuration of 
conventional signs; of some verse or lines of poetry that 
correspond to a possibly incomplete or inaccurate recollec- 
tion. 

These are examples of what is called Data Retrieval, 
' an Activity that aims to sa'tisfy the very common requi- 
rement for a part of a text or record as opposed to a re- 

■ quest for the whole of it. The data retrieved is the actual 
segment of the text or record, not what the segment may 
refer to. What one retrieves from a mathematical table, 

. for instance, is not the value of a function, but an expres- 
; sion representing the opinion of the -table maker about 
the value of that function. What one retrieves from an air- 
; line schedule is not as is well known, the actual times of 
: arrival and departure but an expression representing the 
; intentions of the air transport organization concerned. 
What makes the sequence of digits or the alphanume- 
rical expressions cited become 'data' in the sense of Data 
Retrieval is the fact that they are required by themselves 
, In isolation from the rest of the text. What makes texts or 
^ : records 'data' therefore does not depend upon its sym- 



bols, wording, or Icpic, or any inlrinsic charnctoristic, 
but upon liow it is lo ho. used. In linguistic and literary 
research any document may be used as a collection oi 
data for separate retrieval. Indeed, standard works — - 
Shakespeare, the Bible, major poets — afe often arran- 
ged and numbered for Data Retrieval; that is, for looking 
up parts of the document without having to read through 
the document as a whole. 

Data Retrieval, or 'look up' is an essential prelimina- 
ry to all forms of 'question answering' and other inferen- 
tial procedures from records, whether automated o;' not. 
Data Retrieval is necessary for these, though not suffici- 
ent. If you cannot do it, you cannot go on to do the more 
complex operations. If you can do it, you have still to be 
able to carry out the more complex operations. 

Also, though this is often overlooked, you cannot be- 
gin to perform even simple look-up unless you first per- 
form ordinary Document or Record Retrieval. For you 
must first get hold of the appropriate documents from 
which the segments of text are to be ex'tracted. Bui the 
topic of a document as a whole cannot be deduced from 
its textual content in isolation, it depends also upon what 
kind of reader is going to use it for what kind of purpose, . 
Patent Office searches, for instance, try to find prior di- 
sclosures of ideas claimed to be novel, wherever and for 
whatever purposes these ideas may have been described, 
in fact or fiction. Thus in theory any published document 
may inadvertently contain a prior disclosure in the Hg^-t.. 
of retrospective search. For instance, in the USA humou- 
rous and fictional cartoons have been cited as prior dis- 
closures of patentable ideas subsequently claimed as 
novel. 

The patent offices of all countries have to use the com- 
plete armoury of bibliographical weapons to select from 
world literature as a whole, and to lead down through 
classes of documents, through single documents, to the 
sentence paragraph or picture expressing the patentable 
idea. Clearly there can be no complete solution, even if 
we were to read every word and look at every diagram 
and picture ever produced. For we would still be left with 
the problem of recognizing which of these were relevant. 
This usually would involve problems of inference from 
the text or picture in the light of the motives of the sear- 
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cher and of the original au'ilior of the document as a 
whole. 

For the purposes of this paper I will take a 'docu- 
ment' as being any representation of messages that is 
treated as a unit of' discourse in the social environment 
concerned. Clearly what one person regards as a docu- 
ment may be a part of a document or a collection of dif- 
ferent documents to someone else, or to the same person 
on different occasions. For instance, a librarian will re- 
gard the set of volum.es comprising an encyclopaedia as 
being one document, as will an accountant, for if any one 
of its physical component volumes be missing, the encyc- 
lopaedia will be incomplete. An individual reader of the 
encyclopaedia would regard even one volume to be a phy- 
sically bound co'.iection of different documents in which 
the one he wanted to read was embedded. Similarly a set 
of bound issues of a periodical, a single issue, and one of 
the articles in an issue may each be regarded as a docu- 
ment according as to how it is used. 

Because a document is not a fixed unit of discourse 
but varies with how the records are to be used, there is 
no fixed level at which document retrieval ends and data 
retrieval begins. On the other hand, no single application 
of a bibliographical method, such as subject classification, 
can extend unchanged down from the collection as a who- 
le into the ultimate words or signs of the records without 
changes of criteria. This is manifested by the notorious 
differences, by no means fully explored, between indexing 
a collection of books and indexing the contents of a single 
book. The relations between documents considered as 
units, and within a document considered as an ensemble 
of units differ, whatever entity may be chosen to be, for 
the purpose in hand, a document. At root the problem is 
that the topic of a document considered as a whole is not 
necessarily that of the topics of its component parts; e. g.. 
the theme of a novel or poem is not necessarily mentioned 
in any part of the novel or poem; nor is the topic of a 
scientific journal the topic of the several articles that 
compose it. It is only occasionally possible for the topic 
of a sentence, in the sense of what it mentions, to be the 
topic of the paragraph in which it appears, or of the pa- 
ragraph to be that of the chapter in which it occurs. In 
short, we- cannot find out the topic of a document from 
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the topics of its components alone. Even if we had a con- 
cordance such as, say, a keyword-in-context (KWIC) 
index that covered the contents of an entire h'brary as 
il It were a single huge document, this would be useless 
oilier than for purely lexicographic search. A concordance 
negms its work after the document it serves has been pi- 
cked out from other documents by bibliographical tools 
designed to distinguish between documents, not within 
them. Mathematical tables, which are e.xtreme cases of 
data yielding documents illustrate this point. Inspedion 
of the entries in a mathematical table yields the numeri- 
cal values, or finite approximations thereto, of some ma- 
thematical function at cited arguments. No inspection 
analysis or computation based upon these can determine 
what the function is. Any number of polvnomials or 
oiner Junctions can be found to fit any finite set of finite 
numerical values for given arguments (Weirstrass' app- 
roximation theorem). To find what the function is we 
must look to the page headings or the title on the spine 
ot the book or other metatextual expressions. 

Even more do we have to appeal to book, journal, and 
article titles, and the intentions of the reader, when as- 
signing topics to-fragments of text for the purpose of data 
retrieval The. shorter a linguistic expression, and the 
smaller the segment of picture or diagram, the greater 
tlie variety of documents of which it could form a part 
Therefore items to be used for Data Retrieval must first 
be classified in some way or other, pertinent to their in- 
tended use, before data retrieval can commence. In no 
way does data retrieval bypass the normal bibliographic- 
procedures. Indeed it demands even more powerful pro- 
cedures even more powerfully applied. 
^ Documents compiled specially for data retrieval of 
.<nown purpose .by readers of known habi'ts are usually 
designed to be congenial to these habits. The physical 
and visual formats attempt to satisfy the anticipated con- 
ditions of consultation, so far as these complex demands 
can be rnet. On the level of 'human engineering' alone, 
acceptable satisfaction is achieved onlv by unepsy com- 
promise betxyeen weight, size, typeface, layout, binding 
methods, and characteristics of the printed surface. Sho- 
uld the compilation be needed for whole or partial machi- 
ne use, It must be completely redesigned. If it is to be con- 
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suited by many people, each for a single retrieval, it must 
be designed differently than for consultation by few pe- 
ople, each for many retrievals at a time. Again, the de- 
sign will vary as the use involves a sequence of systema- 
tic retrievals, or a set of arbitrary retrievals. 

These physical aspects must reflect more than the 
physical and operational characteristics of the anticipated 
type of user. They must reflect the structure of discourse 
within the topic for which the data items are to be used. 
Layout, headings, titles, and other metatextual devices 
are manifestations of the logical and .Mnantic structure 
^appropriate to the topic discussed, and the discussants, 
if the same items are to be used for another purpose, the- 
ir arrangement and labelling within the document must 
i;e altered accordingly. 

For instance, single items of data that couple a name 
and a telephone number will be differently arranged, 
printed, and indexed for a general telephone directory, a 
directory of trades and occupations, or for an inverted 
directory of names corresponding to numbers. The physi- 
cal characteristics will vary according to whether the 
directory is to be used by the general public in a public 
place, by trained staff in an office, or in conjunction with 
scanning devices. Again, the actual entries in a table of 
Bessel functions of high order are a selection from those 
in a table of Bessel functions of lower order or, in some 
cases, of those in a table of ordinary circular functions. 
The items are just differently labelled. 

Evidently even moderately efficient and convenient 
data retrieval, even for a sharply defined purpose and cli- 
entele, demands detailed design of the data yielding do- 
cuments and of the ways in which they are to be used. 
Also the overall subject classifications and other biblio- 
graphic tools of the documentary environment must be 
aimed at data retrieval for the "declared purpose. Little 
evidence supports the comforting belief that an entire 
collection of documents can be organized in advance on, 
say, magnetic tape so as to allow direct data retrieval 
for any purpose in the future. The best we can do is to 
deploy duplicates of relevant parts of the collection to deal 

.with existing demands for data retrieval, 'to understand 

or discover the principles needea ,or redeployment; and 
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io have the physical and coniputalional resources to do so 
when needed. 

Many docLimonts not habitually used as sources of 
data can be called upon to do so under the demands of 
linguistic, stylistic, historical, and political studies. For 
these the tradi'tional fusion of special format and concep- 
tual structure is not usually feasible. Even in texts, such 
as the plays of Shakespeare, that are often used both for 
data~e. g. quotations out of context, stylistic analysis— 
and for reading as a whole, compromise between format 
and convenience is uneasv. 

All indexing and formalting devices for data retrieval 
aim eventually at locatnig and ex'tracting a fragment of 
text or record, considered at first just as a string of sym- 
bols. The conceptual criteria for selecting fragments are 
built into the index and format. They are arrived at ini- 
tially from people who have read the complete document 
from the viewpoint of those who are going to use it as a 
source of data, e. g. book indexers. Sometimes, as in lin- 
guistic analysis, the text strings as such are the objects 
of interest, and therefore the entire text is to be used as 
data. Documents that are to be used in this way by eno- 
ugh people may be arranged and indexed wM"th respect ■ 
to words and their sequences just as are other data com- 
pilations. Whether this be done or not, at some htage the 
entire 'text must be scanned by some device, human or me- i 
chanical, capable of recognizing, locating, and counting 
expressions of the required types. This task demands much 
of both humans and machines, even when given rules for 
identifying the required expressions. On the physical le- 
vel, recognition of even quite sharply specified patterns, 
such as printed words in a limited range of size and ty- : 
peface, segments of musical notation, chemical formulas, 
conventional signs on maps and circuit diagrams, is by ■ 
no means theoretically established or operationally feasib- ; 
le in general. Not the least problem is the purely mechani- ■ 
cal one of manipulating documents of different sizes, for- 
mats, and bindings. Holographic and related techniques ■ 
'based on generalized Fourier synthesis and analysis can ; 
pick out patterns, wherever they occur, on a plane, provi- ; 
ded that they are not askew, but such methods are in the- ; 
ir infancy. Closely related to these problems of input are i 
those of output-, ^involving synthesis Tor display or -format; ■ 



104 



The major problems lie in the formulation of recogni- 
tion procedures at all levels from pattern to topic, and 
their manifestation as symbol manipulations. The latter 
task belongs to computer scientists and engineers. The 
former does not, and is very formidable. Even should we 
have a device that could read and identify conventional 
. symbols, we have yet to instruct it to recognize those 
combinations that represent proper names, substantives, 
and substantival clauses, amongst others. We need these 
to find out what a document mentions, a necessary but 
not sufficient step towards finding what the document is 
about in the context in which it is to be used. 

Without doubt computational linguistics and symbol 
manipulative devices are helpful tools for data retrieval, 
and are rapidly becoming indispensible ones. But it must 
be remembered that they are tools, they are not substi- 
tutes. 

Data Detrieval for the humanities— pictorial art, mu- 
sic architecture, history, and literature — differ in no way 
from Data Retrieval for science and technology, other than 
in being more difficult. The varieties of representation and 
of discourse are far wider, and they are not so sharply 
constrained and disciplined. In response to these challen- 
ges, the humanities have made and are making important 
advances in linguistic (including pictorial and musical) 
analysis and synthesis, and in their computational reali- 

■ zations. I would urge that Data Retrieval should not be 

■ considered as something different for the sciences than for 
: the humanities, but that that botli should be considered to- 

gether as part of 'the same general activity based on the sa- 
me general principles. Neglect to do this will seriously de- 
^ lay progress and cause much unnecessary duphcaiion of 
thought and effort. . , 

Data retrieval in the sense of 'look up' or look thro- 
: ueh' to find explicit expressions, is the necessary thougli 
^ not sufficient preliminary for more complex activities such 

■ as 'question answering' and 'answer inferring from re- 
: cords Before dealing with these we must first bear m mind 

that any ansv/er we derive from a document or recorded 
message is a purported answer. It may not be the correct 
one Verification lies outside the scope of informatics. 1 ins 
' comment should be obvious; but many contemporary wri- 
Q tings- suggest- that their authors believe,- or behave as il 
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hey believed all factual statenienfs io denofc facts-even 
in !ction-or that we or a computer can lell whether a 

X 01 s when applying informatics to the humanities. We 
nd u here one more argument against keepinjr scientific 
and humanistic applications insulated from each other 

mformatics is considered as -applied to one field only, 
he the principles and concepts belonging to informatics 
in It. own riG it tend to be infected bv alien principles and 
concepts valid only in the field of application 

Secondly, we must bear in mind that if a document is 

biVur^Kl ^ipiKopriate question must be unam^ 

bigiH. ibly as ociated with the appropriate answer One 
p'Zl '"^'V'^^./i^J^^'tion from its answer, because any exp^ 

ZfZn U ' '"''"'^^ '."^ '^'■"^^^^^ °f ^l^^stions. For 
instance the expression '1917' is (he answer to an indefi- 
nite number of chronological questions, and also to an in- 
definhe number o computational queries such as «What 
IS the product of the cube of 71 and the cube of 3^» The 
answer to any computational query, or to anv query 'invol- ' 
vmg a numerical reply, is to be found in an extensive enc 
igh table of randomly generated digits. Any query about 
onnH '\rr '^'^ ^'^P^'-^^"-^ of aircraft sometimes may be 
0^ 1-M ^^^^/'"^^s departure and arrival of other kinds 

1. ^^n'fl '^^^^'^'"^^ ^he^i- Someti- 

mes aigorithmic procedures are available for recognition 

^^hI5^'^'^'^'''^^l'^^=■^" of the answer is usuall^^^ prefe 
ne ■ ir'fc f q"e/y,and for arbitrary matters in ge- 
no rJ; h hl^^^''J° appropriate timetable by 

normal bibliographic procedures, and then to look up the 

laid\^wnf '^''T^ your query as expressed in the mode 
Idid down by custom or-the mstructions for use of the ti- 
metable. From this point of view the wording and format 
0 ocuments designed for data retrieval ('reference ' 
put to them ^^estions that can be : 

Often such data compilations can be used to answer ' 

questions not explicitly catered for by the designers dTo ^ 
vided that we have valid procedures L this nd th^t^we 

0 not attempt to manufacture data that is not al e dy ir^ ' 

Plicit m the compilation. A. common example is o use a ' 



table or schedule inversely io find which argument corres- 
ponds to a given value, or which name or title to ii given 
serial number, Whetner this is more convenient than to 
use a ready made inverted compilation of the same data 
depends upon many things, including the existence and 
accessibility of an inverted compilation, and how systema- 
tic the data may be. In the days of manual desk calacula 
tion that involved looking up of values in many different 
tables, experienced computors usually found it more con- 
venient and accurate to use a direct table inversely than to 
switch over to a separate inverted table. They did not 
lake this view when trying to use a telephone directory 
inversely. 

In general any document designed for data retrieval 
and the answering of a particular set of questions can be 
used legitimately, if not conveniently, for certain types of 
questions not catered for originally. A document so used 
will be called 'answer providing' and the process of using 
it in this way as 'answer deducing', Many highly systema- 
tic documents provide aids for both direct look up of pre- 

■ recorded answers and for deduction of answers not expli- 
citly recorded. Mathematical tables usually provide for in- 

■ lerpolation between entries, and the amount of instruction 
and printed aids for interpolation will vary with the re- 

■ sources assumed to be at the user's disposal, Present day 
tables for human use are different from those published in 

' desk calculator days, and those for automatic machine use 
very different indeed. Tomorrow's tables and reference do- 
• cuments for humans aided by computer networks must al- 

■ so differ from those of today, and those data compilations 
that are to be used by the computers as an implicit part of 
explicit instructions will have to be designed with very ca- 
reful attention to the type of public they serve. They cannot 
serve all requirements equally well, any more than can a 
printed reference book. 

Only very systematic matters completely closed under 
algorithmic procedures are valid topics for answer deduc- 
tion. By 'closed' is meant that the symbols used not only 

■ represent real entities, in the field before calculation and 
symbol manipulation, but also afterwards. Clearly it wo- 
uld be incautious to interpolate an airline schedule bet- 
ween the entries for successive departure times, but. it 
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would be reasonable to make rough interpolations between 
the time of departure as recorded and the time of arrival 
at its destination. 

Though the fields to which answer-deduction is valid 
are very restricted in scope, they are very important; for 
example, air traffic control, shipping and cargo deploy- 
ment, communication satellites and spacecraft, logistics of 
nursing and hospital management, civic administration of 
well defined activities, manufacturing processes, milita- 
ry applications and, of course, the logistics and manage- 
ment of libraries. Each such application is highly speciali- 
zed and its design demands extremely careful and acute 
study of the samantics of the situation by subject specia- 
lists, of the design and assignment of labels for the topics 
and structure of discourse by documentation specialists of 
he representation and organization of the labelled items 
by computer specialists, and the physical and operational 
integrity of the system by engineering specialists. The user 
also must make some effort to use this tool properly as 
lie must for any tool from a screwdriver to a computer net- 
work or, for that matter, a library. 

Though strict deduction and calculation of answers are 
valid only within very narrowly defined and completely un- 
derstood applications, these are not the only valid forms 
ot inference. Few conclusions drawn in ordinary life are 
as strong as those drawn from deduction or calculation 
However, weak inferences are by no means useless It is 
better to have a weak conclusion correct as far as it goes 
than to have a strong but incorrect conclusion, correctlv 
deduced from false or incomplete premises. Most services 
rendered by libraries and cognate agencies are necessarily 
based on weak inferences. Not being omniscient and omni- 
potent, and not being identical with the readers who will 
use the services, they aim only to provide documents, re- 
cords, and data that are plausible responses to what 
their customers ask for. If asked for the answer to an 
explicit question they seek this by direct data retrieval or 
by deduction or calculation from explicitly answer provi- 
ding records. These may not exist. In this case, the servi- 
ces must seek to supply data, in the sense of fragments of 
or cita ions from records, from which it is plausible to as- 
sume that and answer can be inferred. This demands know- 
ledge of the background knowledge of the user in this par^ 
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licular case. \Vc must note that this Is by no means the 
same as possesing this background knowledge itself, as 
possessed by a practitioner within the topic concerned. Nor 
is it, nor can it be, the knowledge possessed by the indivi- 
dual making the request. What it is 'book knowledge'; that, 
is, familiarity with the modes of discourse enough to re- 
cognize passages of text from which tlie expert reader 
might infer the answer he seeks. There is no certainty that 
he can do so, or that he will do, so. The procedures aim to 
makt it reasonable to assume that he may do so, 

It is well known that this limited aim is not only of- 
ten achieved, but also is well worth achieving. A trained 
librarian, or services designed by him, who can indicate do- 
cuments or data that probably will interest or inform 
: particular readers does more than just save them time and 
. trouble. He may trigger their imagimation and curiosity. 
The problem hers is how to delegate such 'inferential data 
retrieval' or 'answer inference' to less trained agents or to 

■ automate wholly or in part, or to make it possible for a 

■ trained librarian to do more of it for more people in a gi- 
ven time with given effort. 

In special case it may be good enough to search pas- 
sages of text or parts of pictorial records for the presence 
; of particular names or symbols. To make such search 
worthwhile, it must be more than good enough, unless very 
weighty matters are involved, or the users are prepared 
to submit the output to very refined examination or the 
topic and mode of discourse are so narrow that readers' 
interests are adequately indicated by the occurrence of 
particular words or symbols. Searching of records, by hu- 
mans or machines, is a difficult and expensive operation. 

Even if complete searches of text are feasible, search 
and selection for inferential data would still demand know- 
ledge of the structure and language of discourse of the 
potential reader, of the author of the text, and of how to 
correlate the two. But these are also necessary for the ef- 
ficient design and application of documents made for exp- 
licit data retrieval. Also, of course, they essential for sub- 
ject classification, 'content analysis' and other operations 
" aiming at document retrieval as opposed to data retri- 
eval. , , lU J 

The two are not in fact opposed, one deals with aocu- 
o ments as a whole, the other as with fragments of docu- 



men s. But we have seen earlier that the notion of 'doru- 
ment is relative to use, not an absolute property of an iso- 
lated i)locl< of recorded discourse, The differences between 
document and data retrieval arise only from different choi- 
ces of unit of discourse or record, From the point of view of 
question answering in the wide sense, they are both tools 
ot Reference Librarianship in the wide sense. They musf 
not be regarded as distinct, but as governed by the same 
pnnciples, and as liable to merge one into the other wlie- 
.ther the records used are mathematical tables, m'ui -al 
scores, maps, sound recordings, or novels. 

From the physical and engineering points of view the 
differences between dealing wiili an entire I'brary or with 
a fragment of text are considerable, TraUocriptions and 
formats must be designed for particular needs, they cannot 
satisly ail of them at once, vlutomation for data retrieval 
mi lally should be designed as the personal tool of a trai- 
ned person, corresponding to the traditional Reference 
Librarian, acting as a delegate of the readers. Automation 
01 data retrieval and its derivatives for the direct use of 
readers, as in a multiplcA- computer network, entails auto- 
mation of the reference librarian as well. 

The root problems of Data Retrieval remain those of 
Document Retrieval, both in the bibliographic (descripti- 
ve) and semantic (topic, subject) aspects. We have to find 
how much social and linguistic experience of various ac- 
ivities must be built into the system, and also to find how 
to build it in. These problems are common to applications 
in both the sciences and in the humanities. 
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A CONDENSED HEURISTIC DOCUMENTATION 

SYSTEM 

1, Introduction 

The scientific world is currently alarmed by the exis- 
ting exponential rate of increase in documentation. Conse- 
quently, a system which would allow condensation of nu- 
merous scientific data, literature and experimental know- 
ledge in one complete treatment is of undeniable interest. 
If such a system also holds the possibility of suggestion 
of ideas and even of organisation of research, then one ob- 
tains a process which constitutes a serious step towards 
a solution -of this major problem confronting scientists. It 
is this process which we present under the name of Con- 
densed Heuristic Documentation (C. II. D.) 

Cur system has perhaps one drawback in that it as- 
sumes a good analysis of publications and a precision of 
reasoning. This inevitable obstacle of the actual form of 
presentation of scientific texts could be overcome in the 
future if scientists were to agree on a more consistent use 
of words in a more precise language. Science must find 
its language and, until it does, researchers will have to 
rely on specialists in documentation to perform a part of 
their work. 

The system proposed is based partly on a generalised 
logic described by one of the authors [1] and partly on a 
consideration of couples of concepts and the transition 
from couples to series, two ideas which are fundamental 
to the human mind and which have been fully described 
by the psychologists Henri Wallon and Jean Piagct [2]. 
The idea of couples is also convenient since it allows a 
simple representation in 2 dimensions and the use of clas- 
sic perforated card indices. 

2. Coupling of concepts 

Our method is independant of the choice of concepts 
which must only obey the rules of their own particular 
science. 

ill 



Concopls iirc considered first of all by couples, each dc)- 
Uple of concepts being connected by one or more coup- 
ling numbers wbicli express in a conventional way, all or 
part of the following indices: 

reltaion existing between the concepts considered, 

plausibility of the relation, 

weight of the 2 preceding indices, and bibliogrnphical 
reference 

Each of these terms will be defined in paragraph 3. 

The coupling number can be written in any svsteni, 
^vhether decimal, dual or other, the transition from one sy- 
stem to another b?ing classic. A number described in the 
decimal system can therefore be converted to a number 
of base 2 with a view to its use in a computer programme. 

A possible example of a coupling number in the deci- 
mal system is: 

5412 / 31 / 07 / 45 

Number of the biblio- weight plaiisibi- relntion 
graphical reference lity 

Such an expression is valid only if the same number of 
figures always refers to the same classification of terms 
and if codes for relation and plausibility are established. 

In the example given, the 2 figures on the right exp- 
ress the relation and the next 2 figures to their" left, the 
plausibility. 

If one is not interested in the weight but can limit 
oneself to plausibility, the 2 corresponding figures can be 
omitted, thus simplifying the expression, Similarly, if one 
does not wish to retain the references, one omits the fi- 
gures on the extreme left, 

3. Definition and calcultaion of indices 

a) Relation is the nature of the logical link existing 
between two concepts, There are several types of 
relation, of which it is customary to consider first 
inclusion. This is represented in most classic logics 
by the symboic^^ where the expression AcB cor- 
responds to 
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H AcB and BcA we can write A=B, giving us 
the relation of equality. .. 

Implication is usually represented by the symbol-^ 
-> where A->-B means that A implies or leads to B. 
If A-^B and B-^A, then \vc can say that A is equi- 
valent to B. 

Further, if A-^B and C-^B then A is analogous to 
C. One can, of course, imagine other relations and 
conditions as well as those defined, a further 
example being the relation in experimental science 
expressed by the words «has a value of», which al- 
lows the coupling of a general concept with its me- 
asured value. 

b) Plausibility is the value attributed to a judgement. 
As shown elsewhere (4), plausibility can not be con- 
sidered as a probability, since it does not satisfy 

the condition pA+pA= 1. 

The following discontinuous scale of plausibility 
is employed: 

b) 

-1 -0,9 -0.7 -0.4 -0.2 

False very probab- probably liypothctlcMl- very hypotheti- 

iy false . false !y false caliy false 

0 

uncertain 

+ 0.2 +0.4 +0.7 +0.9 +! 

Very'liypothcti- hypothetically probably very probab- certain 
cally certain " certain " certain ly certain 

Plausibility can be determined according to the fol- 
lowing rules which may take the form of a definition, or 
a convention translating as closely as possible a line of 
thought, or a compromise with the theory of probability, 
and which, all together, must lead to a consistent theory: 
1. If the same relation between two concepts is ex- 
pressed bu different plausibilities and weights, one 
can condense the information in a single coupling 

number which has a plausibility value of —. 
where is the weight of the various individual 
coupling numbers and Pi the corresponding plausi- 
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bility. In this case, we have a definition of plaiisl- 
iDility which we treat as a means of measurement. 

2. l! A is analogous to B, and B is certain or very 
prol}al)ly certain, then we can say tlial A is very 
liypotlietically certain. I-Ierc llie rule !al<es tlie form 
of a convention. 

3. If A is analogous to B and to C and bolli B and C 
are certain or very probal:)ly certain, tlicn A is hy- 
potlietically certain. 

4. Rules 2 and 3 are valid if the word «certain» is re- 
placed in each case by «falsc». 

1'' ]\ \" 

5. If A->B and B->C, then A^C i. e. tlie plau- 

£ Q_ 

sibility of the final relation is the product of the 
individual plausibilities of the two initial relations. 
This is a convention based on the transitivity of 
tlie relations — implications, inclusion and equa- 
lity. 

6. If A-. B and AcC, to C->B where P' will be 
very hypothetical for any level of plausibility, p. 
This is a convention of generalisation. 

7. Concepts which are equal or equivalent have the 
same level of plausibility. 

8. 'Students' rule oft' (3) is compatible with our dis- 
continuous scale and our definition of plausibility 
for t<1.5, as has already been established in a 
previous work by one of the authors (4). This rule 
involves a compromise with the probability theory, 
and can be used to calculate the plausibility of 
quantitative observations. 

9. The plausibility of a qualitative observation must 
be assessed by the scientist in accordance with the 
usual criteria and will normally be «probably cer- 
tnin» or «very -probably certain». 

There are other minor rules governing the determination 
of plausibility (4) but those given here are the most im- 
portant. 

c) The weight of a judgement expressed on a relation 
between two concepts is the value of this judge- 
ment with respect to all previous judgements on 
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the same relation between these two concepts. 
In practice the same weight (i. e, 1) is often given 
to each judgement of the same relation, and if the 
plausibility figure which appears in the final coup- 
ling number results from, say, 10 judgements, the 
weight of this plausibility, pi,' will be 10. A single 
sew jiidg-ement of plausibility, \h, will have a we- 
ight of l,ibut on adding it to the system, wc obtain 
a final weight of H and a corresponding plausibi- 

Itity of ^^1^^]^^^ (rule 1 § 3b) 

d) Jha referenice is thai used classically in documen- 
lation for easy retrievat of the document which has 
been used as the basis for writing the coupling 
number. It ts often the actual number of the docu- 
ment in the chosen system. 

4. Trainsition from couples to series 

The transition from couples to a well-ordered series 
of concepts must first satisfy the rule of transitivity: 

AcB and BcC leads to AcBcC 

It mu5l then satisfy the rule of coherence: 

If AcB we can say B^A 

The se iation of concepts from couples connected by the 
same trsnsitivc relation is a perfectly mechanisable ope- 
ration. This can be illustrated by the following easily ge- 
neraifeable example: 
Starting from 

A->B(1) 
.B->C (2) 
A-> D (3) 
•D-B (4) 

we can write 

from (1) A...-:-B,.. (5) 
from (2.1 iii..-^Cn. (6) 
from A...->Di.. (7) 
from (-) D,..->B„, (8) 
and from ((6) we can dc-iuce: 

'fe..-Cni (9) 



8* 
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and hence finally 

A...->Ui..->B|i.-vC 

Wc could just as easily liave written: 

A. ..-^B.., (5) 

B. ,.-->C.„ (6) 

(7) 

D..,->B.„ (8) 
and B.,i->Ci,i (9) 
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which arrives at the same. result. 
We have very simpiy allocated indices to the concepts, 
albeit as basically as possible, but in such a way that the 
order existing between any two concepts is indicated by 
. the indices. 

Where concepts are connected by different relations, or 
by equivalent but intransitive relations, the problem be- 
comes more complicated. It is often preferable in this ca- 
se to replace seriation by a matrical representation, 
which can then be easily converted to a linear form, A 
simple example of this process is the following: 



the following: 



D 



B 
C 



corresponds to 



■ BcA 

and B->C->D 
p' 

from which we can deduce: A->C->D (rule 6 in §3a) 



5. Operations on concepfs 

The usual operations to be considered on the concepts 
are logical and mathematical. The mathematical opera- 
tions are well known and will not be discussed here. The 
logical operations are union, intersection and abstraction 
and satisfy the following definitions: 
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Union of and B^^^AW 

.-P^"^o^ '(for A aiid.B luiviiig the 
~ " same weight— rule 1 §3b) 
Where E is the union of iiie intersections and may h\ 
represented pictorially as follows: 

Intersection of A^ and P^'^kW. 

--EP' (for P'<P) 
E rcprcseiils the intersection of tlie exten- 
sions 



Abstraction of A'' and A'^'^A^^BP' 

-EP (for P'<P) 
E = intersectioii of tlie comprehensions and 
union of the extensions, 

It should l)e noted that the extension of a concept refers 
to the number of oljjects which the concept signifies and 
the comprehension refers to the qualities of the concept. 
Using the system described for the representation of con- 
cepts by couples, new couples can be formed starting 
from a scries of couples, for example: 

Acq gives us (AUB)cC 

BcCi and (ARBjcC 
New concepts created by combinations of initial con- 
cepts are designated by a word or figure*. The retrieval 
of these combinations may be simplified by constituting 
perforated card indices (or auxiliary memories) which gi- 
ve a list of the various combinations carried out on the 
initial concepts. 

In certain cases, it is necessary to establish a list of 
categories for the combinations made. It is difficult to 
imagine, for example, a complete list of carbon com- 
pounds. Here, one would have to consider first the main 
classes of products or some other criterion. 
Conversely, an initial concept can be resolved into 
its elemental concepts which may be studied in the same- 

* or a combination of letters and figures. 
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way as combinations. In this case, the elemental concepts 
or combinations of these elemental concepts can be de- 
duced from an analysis of the initial concepts. 

6. Calcultations on coupling numbers 

■We have shown how conplinfj numbers can be calcu- 
lated for.onc couple of concepts. SevenI coupling num- 
bers of similar relation can be replaced by a single num- 
ber having the same relation but a plausibility calculated 
by the formula already given: Different rela- 

tions for the same couple can be combined according to 
certain rules. 

As we hove seen AcB and BcA leads to A = B, 

A-;-BnndB->A tells us A is equivalent 
to B, 

A->BandC->B tells us A and C. are 
analogous, 

A->B and AcC gives C->B, 
and A->B~>C gives A->C. 
^ — ■ ^ 
Combinations arc logical or mathematical operations. 



7. Scientific reasoning 

We have already seen some aspects of scientific rea- 
soning in analogy and generalisation. Two others which 
deserve a special mention are the formulation of hypo- 
these and the «game theory». 

A hypothesis . results from a combination of factors 
which are each obtained by analogy, the combination it- 
self also being an analogy. A simple example is the case 
of an unknown product, X, giving the decomposition pro- 
ducts, A, B, and C: 

X-B 

x->c 

If we know the factors, M and N, which give the same 
decomposition products, we can deduce that X is analo- 
gous to M and to N. Further, if M and N have in common 
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(bv abstraction) a structure, S, we can say that X has a 
plausibility, p, of having the structure, S. This can be 
represented in malrical torm in the following way: 

^ 3 C M N (class of slrucUircs, S) 



X 
A\ 
N 

MaN 

The game theory has been applied in scientific re- 
search for the organisation of experiments. Its matrical 
representation is classic, being that of the hypotheses, 
Hi, H2 and H3 and the corresponding optimal experi- 
mental plans, pi,, piz and pb, which may be the result of 
an analogy or a statistical rule. This is written m part 
by the general matrix: 





a 


b 


c 




-> 










— > 




H,npi.i 






-> 


H,npi2 


— > 







where a, b, c, etc., arc the results of application of the va- 
rious combinations of hypotheses Hi, H2 and H3, and 
plans, pl|, pl2 and Ipa. The choice of the best plan is made 
according to the usual rules of the game theory. 

8. Example 

We have chosen a simple example of application in a 
field which is familiar to us. The coupling numbers are 
formed as follows: 

1. The 2 figures on the extreme right express the re- 
lation according to the following conventions: 

01--> 
02=^ 

03 = equivalence 
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_> _> ~y 
-> -> 
-> -> 



analogous analogous 



CP 
C 



04-c 
05 =D 
06==equality 
07 = abstraction 
• 08=-«hcis a value o[» 
09 = analogous 
and so on 

2. The third and' fourth figures from the right express 
the plausibility according to the following conven- 
tions: 



01- 


-1 


02 = 


-0.9 


03 = 


-0.7 


04 = 


-0.4 


05 = 


-0.2 


06 = 


0 


07 = 


+ 0.2 


08 = 


-f 0.4 


09 = 


-fO.7 


10 = 


+ 0.9 


11 = 


+ 1 



3. The fifth and si.xtli figures from the right express 
the weight which will always be less than 99. 

4. The reference, on the extreme left, is the number of 
the document consulted or the number given to the 
observation or to the e.xperiment if these are classed 
in the same way as documents. 

The following matrix is obtained: 







So 


SeOj 


s 

SflcaloriesnO, 
Sencaloriesnd 


102.02.09.01 


lOi.10.09.09 

1 


103.01.07.0! 
104.04.09.0! 
105. 10.09. 01 

106.15.09.0! 



I 



ERIC 



where reJerence number 101 

102 - 

103- 
104- 
105 - 

106-= 



number ol Ihe clo:umcnt 
consulted 

number of the experiuienl 
giving this result 
the reasoning used 
the experiment 
further, very careful expe- 
riments 

calculus of coupling num- 
bers 



Mathematical examples, which at first sight may seem 
particularly difficult, can also be treated quite simply by 
this method, in which case the plausibilities are generally 
limited to +1 and —I and the weights generally omitted. 

9. Conclusion 

The process of Condensed Heuristic Documentation 
enables information to be recorded in the form of couples 
of concepts linked by logical bonds whose value is expres- 
sed by a coupling number. It allows the- transition from 
•these couples to series or tables which condense the in- 
formation. This information can then be further condensed 
by calculations on the couplng, numbers- The process al- 
so allows reasoing by analogy, abstraction and generali- 
sation at the same time as deductive reasoning. 

The original concepts may be easily traced. One way 
is to use a perforated card index for each couple and an 
auxiliary memory to give combinations. 
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(Bclgiujii) 

PROPOSAL AND WISH ES FOR AN OPEN 
STRUCTURE IN THE COMMUNICATION OF 
INFORMATION 

Theme of this work 

One can consider with fear tlic present and increasing 
predominance towards the centralization of information 
into more and more powerful knots. This situation may 
imperil the development of specialized information centers 
which, solely, are efficient for maintaining the neces- 
sary dialogue with responsibles of the often very 
narrow fields of human activity. It could become painful 
to keep the good balance between present tendencies, that 
is between centralized concentration and specialization. 

By further improvements of communication we may 
ease the setting up of ^networks and interconnections. If 
these networks possess a; free flowing communication, 
this will implicitely ease ' a decongestion of encumbered 
centralized knots and of course improve their efficiency. 

Centralized information presents an analogy with cen- 
tralized management of a company and, on the other 
hand, the network evokes ideas of dialogue and assign- 
ment of authority. This calls for a fundamental change in 
the structure. The analogy also exists in the results of 
such a change: the new structure leads to decongestion. 

But the communication inside the network can only 
become a free flowing one if the needed ADDRESSES 
arc unequivocally worded and standardized. These addres- 
ses are examined with some detail. 

Information parts in two kinds of functions, i. e. the 
«treatment» and the «communication». However these 
two functions are very often under a one-headed res- 
ponsibility. 

The setting up of the network needs a centralized res- 
ponsibility of the communication and independance from 
the responsabilities of treatment. But on the other hand, 
the last ones will earn to be decentralized. 
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A high level of the quality of communication can trans- 
form information activity into a trade or a business, a 
prodigious dynamic and prosperous one indeed. 

«Ilo\v Id imagine nnswors wliicli 
solve and stinplify? These 
yro prt)l)le.ms to " whieli a refloclioii 
upon congeslion may help*. 

!. p. IG] 

I. Looking backward: the intellectucils private library 

Lets return in time to the period when each intellec- 
tual patiently created his own library, as a real external 
memory connected «on line» to his brain., the only compu- 
ter then available. 

The efficiency of such a private library rested upon 
particularities which eased comniunication, particulariti- 
es even more significant as we no longer succeed in 
using these same advantages. Here arc the characteris- 
ticsr 

ADDRESS OF THE USER is solely: the proprietor's 
own. 

—ADDRESS of the DEPOSITORY: is also one only, 
these being a concentration of books in one single place. 

— These books were selected with care over years and are 
well known to the proprietor. Memorised knowledge of 
BIBLIOGRAPHIC ADDRESSES is good, even if use of 
an index is a pretty good thing. 

— Content of these books has been mastered thoroughly. 
The photographic memory helps, marginal notes, remarks 
and underlining with pencil are warning devices. The 
brain of the creator of this library disposes of fulgura- 
ting means of access to information which concerns the 
INTELLECTUAL ADDRESS specified by the terms (actu- 
ally we say keywords or concepts) of the given question. 

— ADDRESS of the SPECIALIST well versed, whose 
judgment serves in orienting readings and researches, 
also is the proprietor's one. 

Such a library is a device enjoying free flowing com- 
munication, assured by well mastered ADDRESSES. Their 
interjoinings seem so natural, reflex and implicit that 
one needs a careful analysis to persuade itself that these 
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five types of addresses have each one their role in rea- 
ching; the result: a well oiled operation. 

A modern information centre established for the treat- 
ment of large collections seldom succeeds in attairiincr 
such a privileged position. 



2. Babelism of systems 

Congestion jeopardizes today the former and comfor- 
table situation. Every man in charge of a responsobilily at 
any place in the information chain knows that his me- 
mory and his judgment are tools requested for helpinf^ 
the. work of any «information system». And he also 
knows that such human components become less efficient 
when the size of collections increases. Thaz man surren- 
ders his role to systems, and everything rests upon the 
conception of these. 

Recently, in April 1969, editors of the French publica- 
tion «Prospective», Paris, published a booklet devo'ted to 
«L Homme encombre», the encumbered man fl]. One 
chapter by Pierre Masse gives three remedies for the ore- 
sent congestion of human life: elimination, ogranization 
and invenifion. The present paper makes proposals that 
may enhance organi'zation.) 

Which are the causes of information congestion? The 
mam one is of course the terrific growing of the bulk 
of published works. But an unexpected obstacle came 
from the autonomy of each link by which we have speci- 
alized inside the information chain, for instance: publi- 
shers, booksellers, librarians, responsibles of any infor- 
mation centre and also consulting engineers- in any 
fields. Each of these activities are free from the others- 
the man in charge of one is not paid to organize free 
flowing communication between them, he does not care 
for that as a chief responsibility. Each one will take 
care of communication inside his own house but will 
mvite outsiders to adapt themself to his own 'communi- 
cation state. The author acknowledges that, likewise 
other people m charge of a centre, he was obliged to 
organize his own autonomous ways of communication, 
but he ..still deplores such a «babelism» . of . private 
systems. ^ 



124 



J'rom that moment and as a result of the «babelisni» 
of systems, the growth crisis becomes a communication 
crisis: general communication becomes intricate, slows 
down, may even stop. 

New necessities are facing the inadequacy of present 
structures. An information centre which could be compe- 
tent in any field is an obsolete pattern. The new pattern 
we need involves to help one another inside a network 
of centres where the most qualified one has the privilege 
to answer. All that supposes a free flowing commi ' 
lion which the autonomy of systems does not and c ..i 
not allow. That situation gives birth to a conflict betwe- 
en old and new patterns. -One have to frame a new struc- 
ture. 



3. Obsoleteness of the present structures 

One of the main duties for anybody in charge of an 
information centre is to choose, work out and uphold 
; for his own use the most nicely fitted system of treat- 
ment. Here «treatment» means also «communication», 
and in such a way these systems radiate as feudalism: 
each system possesses his fief, inside which communica- 
tion may be a good one, but between the fiefs, that me- 
• ans between systems, communication earns what earns 
compatibility of systems, often not very much or nothing 
at all. Constraint to «translate» one system into another 
is prevailing, need of a «language» understandable by 
everybody in the chain, more, a language convenient for 
the computer, become an obvious request. 

But the each day reality may be worst. Non compa- 
tibility of systems kills mutual acquaintance. Each docu- 
ment has to be registered in each system, bringing evi- 
dent redundancy of efforts. In spite of a natural willing- 
ness mutual help is inefficient. Duplication of treatments 
is a heavy loss of efficiency, but nevertheless it is eas- 
ier and less costly than to translate from one system 
into another. 

i An information centre may no more pretend to be 
; omniscient or omnicognisant. To help one aaiother thanks 
; to a free flowing, communicatiion between complementary 
i activities became the requisite: for achievement. That in- 
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volves coinpaiibilily of systems and setting up of com- 
munication rules. 

To work out and uphold such a yc-neral communica- 
tion urges that a prevailing authority becomes respon- 
sible for it, which means an authority' above feudality of 
systems. Tlie autonomy of anybody in charge of a centre, 
that is his right to choose his own system, requests an 
accomodation. It is not easy to replace loneliness of the 
centre inside .this fief by an open «one-another-lielp». 
That needs a change of structure which may be under- 
stood as some loss of autonomy. Nevertheless any res- 
ponsible of one «iink» inside the information chain ought 
to accept such a reduction of freedom, but, on the op- 
posite side and thanks to standardized ADDRESSES, 
efficiency of communication would be enhanced. From 
that uionient automatic (and already living) equipment's 
would take up a prodigious mass of tedious, small and 
tiresome jobs which are still hiinutely done by men with 
unnumerable amounts of redundancy in the world. So 
we see now that: babelism of systems becomes the de- 
stroyer of systems. Upon such terms we may propose 
a better understanding of all the information activities. 
These are everywhere parked in two distinct functions, 
one of TREATMENT, and the other of COMMUNICA- 
TION. The first is the «message», the second is the 
«transfer», but we speak so easily by using words in me- 
tonymy that we confuse the message with its communi- 
cation. Nevertheless such parting appears quite often in 
the reality of our structures and of our responsabilities. 
Wireless or phone is a process for transfer of messages. 
Bank account numbers arc addresses for communication, 
but responsability of transfer and responsability of manage- 
ment are different: the first belongs to the banker The 
sacond to the titular of the account. The road traffic code 
concerns communication, but neither, the vehicle nor the 
transported wares. The use of postal sectors numbers 
makes our mailings Gjuicker and safer, but that does not 
modify the «message>,. 

We may lengiiten such a list at random, the list .of 
human activities parited between one responsability of 
TREATMENT and owe of COMMUNICATION, and ha- 
ving done so with success and for better efficiency. It 
is quite amazing thai!;, at the present age of the compu- 
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ier, information transfer is probably alone not to have 
chosen such a twin-headed structure. 

COMMUNICATION appears everywhere with the 
character of a collective or centralized responsability or 
authority and, if that is of freneral acknowledgment, 
TRFATMENT may be dccentrniized without any rusl< to 
become inefficient: Such structure is a /ombination of 
a central and collective organization of COMMUNICA- 
TION with a functional de-centrali/.ation of IRhAl- 
MIiNT in an open network. 

Urgency of such a structure is born as a consequence 
of growth of information activities: we needed to diversi- 
ty and we have ramified in the numerous «links» forming 
the «chain» of information, but with an effect to slow 
down communication. That evolution, complexifying with 
functional diversification is similar with the evolution 
of management within a company. In both cases the ne- 
: cessity of the dialogue appears, the necessity of a better 
: «one-another-help», and the one of assignment of 
authority and responsability. That needs, better commu- 
nication, networks and informal interconnexions. «Bet- 
ween the private man and the community, relations will 
become more and more numerous, and we need a cyber- 
netic mechanism for a better fitting up»='='. These cyber- 
nelic mechanisms will use codes, as the road trafic code 
does at the -level of the community. Information transfer 
may not escape such a natural law, a law thanks to 

■ which a new structure of communication is needed when 
treatment complexifies. Till now' transfer of information 
does not care of that. But such an ac^^lf^^f '^1'^^,^°"; 
dition for a more free and more efficient rREATA'\LN 1 o. 
information, a treatment then becoming able to trans- 
form itself in a prodigious industry and trade. 

Efficiency of such an organization for a free flowing 
communication requests compatibility and coherence of 
the ADDRESSES. 

It is not easy to make out such addresses for, indeed, 
the target is to promote a general use them. The basic 
■. terms for such a drawing out will be: 

MV^nslaiocI Worn fl. p. «Ent.re I'lndividu ot la collecUvjte 
les relations seront de plus en plus nombreuses, et il ust necessai-re 

■ qu'un veritable mecanisme cybcrneUque vienne a^ssurer la bonne arti- 
O culalion». 
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~ To draw the' functions io be filled by these addresses. 
■— To create procedures which may ease the use of them. 
~ To choose each time arrangements convenient not on- 
ly for the human user but also for usage by equipments 
without requiring a translation. 

— To try to make the addresses mnemotechnical. 

■— To do so that the addresses may be used for mailing 
when useful. 

A timetable of approach for that problem may be: 

— To review what is already done, or presently under 
project. 

— To describe the different jobs to be made. 

— To give a priority order for the different practical 
stages. 

— To define the rcsponsibles of the communication's co- 
de project and maintenance, and to define the authority 
to head that organization. 

— To realize the project in the frame of a pattern inside 
which speed and efficiency of communication prevail. 

in Chapter (I) we looked at communication inside 
the intellectuals private library. That led us to the defi- 
nition of FIVE addresses having each a functional role. 
Our capacity to organize an international network wor- 
king with a free flowing communication depends on — 
or, is similar to — our capacity to enlarge the use of these 
FIVE addresses up to the international level. 

Our target is to transpose at international level the 
terms that make efficient the use of a private library 
by his owner. 

4. The addresses which requested 
at international level 

We will analyse each type of address one after the 
other. In each case we will try to discover the procedure 
going to the most extensive coordination and to the most 
easy-to-use standardization. But standardization is not 
compulsion. We may gain a better acceptance if these add- 
resses respond to natural and simple rules, quite unequi- 
vocal rules. If we do not proceed that way the standardi- 
zation may become unpractical, going easily to failure. 



128 



The most significant nddrcssGS arc the INTELLECTU- 
AL and the BIBLIOGRAPHICAL ones. They typify res- 
pectively tiic responsibility of the information scientist 
and of the librarian. We will analyse both before looking 
at the otiiers, which are more similar to simple mailing 
addresses. 

Ifiielleciiial address 

This intellectual address — or intellectual destination — 
of each document is usually set up by a list of «concepts» 
which are controlled by a thesaurus. Those thesauri inc- 
• lude generally an enormous part of quite specific words of 
which the use and tlie meaning are out of reach for most 
people. They are words for specialists of narrow fields. 
Stability of such words may be short in duration and also 
in precise meaning. Such practice to use specific words 
arises from our academic time and is going on further 
; with our professional and specialist activities. We lose, if 
; never we get it, our taste for synthesis. Onr above-all 
: analytic education is a real handicap for the treatment of 
\ information. We incline to omit use of generic concepts 
which may specify the character of a document. Our 

■ mind does not organize easily the natural linkage between 
: generic and specific words. 

' Cur first remark is related to the use of specific voca- 

■ bularies. They ought to be left for use by acknowledged 
people in touch with each specific field. In view of an in- 

■ formation retrieval target it is most often useless to defi- 
ne with the most specific concepts wdiat may be called for 
with trivial words by non-specialized people. We realize, 
and thereof the information scientist is very conscious, 
that documents are to be «indexed» with concepts chosen 

' at several levels, which means concepts covering the mea- 
ning of words fanlike from generic to specific. Specialized 
information centres 'take place at the specific end of that 
fan, and centres which receive an encyclopaedical or omni- 
scient mission reach the generic end. The intellectual com- 
: munication in such a large network of information centres 
: belongs to special linkages that have to be realized by an 
; unequivocal generic-specific structure of thesauri. A deep 
; understanding of these relationships is basic for informa- 
tion communKiation. 
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A second rcMiiark concerns comnuinicalion between spe- 
cicilized fields of different kinds. When a specialist escapes 
in such and other field of knowleage he uses simple words, 
not the esoteric words of that field, but words of generic 
feature used by a man of extended general education. He 
wishes to receive answers to his questions with words of 
Ihc same feature. The difficulty to communicate between 
specialists of different fields may become terrific bccau.sc 
01 the specificity of words. We suffer a growing need of 
more didactical papers speaking with generic and simpler 
words. 

Communication between different specialized fields may 
not be easily set up by specific words, but more easily by 
words of generic appearance which create links. at a hig- 
her level of ideas. We can conclude that the making of a 
so called «universal thesaurus» is an enterprise without 
any hope, and may be also a useless one, for nobody will 
ever use it up to the very specialized words it will contain, 
, On the other hand, a single thesaurus controlled at the 
level of generic words would be a fundamental objective of 
a different kind, and would be indeed an objective more 
easy to reach. Such a thesaurus would retain two quali- 
ties of exception, to be short and to be stable. That thesau- 
rus responds to the idea of synthesis of knowledge, wdiile 
specialized thesauri have no capacity to emerge above our 
current analytical behaviour, . ■ 

The American committee SATCOM [2] tried to organize, 
information needs in groups supplied by special services 
the so called «NEED GROUP SERV1CES». The SATCOM; 
studied the U. S. A. case and concluded to the existence of 
about a thousand of those groups, which is not a very^ 
high figure. It seems reasonable to think that a worldwide^ 
census will not give a very higher one. The most useful' 
and practical proposal of the SATCOM report was to or- 
ganize feeding of each group by a kind of surveying made 
by special information centers. That idea is sane and we 
ought to make the most of it. We may ramify several le- 
vels downward, what means introducig some logical hie- 
rarchv in the «need group5» list. That last option may be 
compared with the head-categories of the «Engineer-In- 
dex» or of the «€hemical Abstracts*, or with the titles ot; 
the secondary reviews of the VINITI, Moscow. Still better,; 
it seems that the. EXCERPTA MEDICA FOUNDATION,; 
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Amsterdam [3], look more advjiiilage of that same idetl 
with a four steps hjgical classification. The last starts 
from a set nf 33 main categories ramified deeper in a lo- 
gical hierarchy. Presently the whole includes about 3.000 
categories and subcategories. Those categories respond to 
a pragmatic target: to define user's profiles for each do- 
cument. The indexing of a document in one or several ca- 
tegories grants it a generic definition, which is not always 
very easy to do with most of the specialized thesauri nt)w 
at disposal. After that granting of categories, the indexing 
is developed deeper with use of a very large thesaurus. 

The following idea is emerging from the above conside- 
rations. For retrieval's sake the intellectual definition of a 
document requires at least two levels: 

— Belonging to one or several categories. 

— A coordinated indexing which uses concepts of a the- 
saurus. 

We may agree one with the other, and also at worldwi- 
de level, on the definition of these categories. On the other 
hand the most part of the concepts will positively remain 
•under the jurisdiction of specialists: it is out of sanity to 
control specific concepts at a general level. Such a position 
seems to be the one of EXCERPTA MEDICA FOUNDA- 
TION, as indeed the granting of categories is made by ge- 
neralist physicians while indexing by concepts is finally 
under control of the specialist physician who makes the 

Such" partition of the INTELLECTUAL ADDRESS — 
or, intellectual destination — in several steps is of course 
a sane practice. Indeed the «categories» represent what 
is more permanent in meaning and in duration, what is 
also assimilable by the community, and most of all, what 
is necessary to address the documents to acknowledged 
specialists, while — on the other hand — the «corLcepts» 
may require all the science of the specialists to be expSai- 
ned in their meaning and in their evolution. 

The w^ay is open naturally by such partition for an ag- 
reement up to international level. One has first to define 
the «categories» which represent each «need group», next 
one has to ramify these categories in the respect of the 
«profile» of each need group. Such procedure will iiintro- 
duce a lot of more or less generic «concepts», for they 
are used by several «need groups». But one must keep c'le- 
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arly in mind tliiii most specific concepts will never fmd 
a place in the generic tliesaurus: lliey are subjects for spe- 
cialized thesauri. There is a border closed for specific con- 
cepts. 

The information scientists have from all times made a 
partition of their activities in more or less arbitrary cate- 
gories, each of them being an image of a group of users 
or of a particular type of questions. What is new in the 
proposal of this study is to draw such a partition into ca- 
tegories that will become of general accepted use. 

The procedure which is here proposed consists in star- 
ting at generic level, and then in going down with care to 
more specific ones. The usual procedure goes just counter- 
current; one has built numerous specialized thesauri 
apart one from the other, sometimes even indeed with a re- 
al success, but one failed obviously to realize an integra- 
tion of such dissimilar material to more generic leA^els. It 
is the merit of organizations like SATCOM, EXCERP^A 
MEDICA and probably a few others to mitiate the inde- 
xing by the generic end. ■ 

1o use «categories» at the generic end and «concepts» 
bv coordinated indexing at the specific end is the marri- 
age of two procedures which contend for supremacy since 
years Each procedure possesses its trumps. It is more e - 
ficient to use these trumps than to persist in a rivalry at- 
titude Such a combination of both procedures will be a suc- 
cess if we keep clearly in mind the fundamental difference 
between' a «generic category» and a «specitic concept». 
A «cate<yorv» represents a human activity, a ^<need group». 
By contrast one may say that «concepts» are nearly fa- 
cets inside the need groups. , , - , 

One must say that the proposed categories have no- 
thing in common with a decimalization or with an arbit- 
rary or compelling hierarchy as the U. D. C. for instance 
does. 

Bibliogtuphical address 

■ Already a few years ago some people - and anjong 
Ihem the author [4] - attempted to promote the use of the 
«CODEN» code of the A. S. T. M. [5] in order to give a bib- 
liographical address by care and under the responsibil ty 
of The editor. That address would be unequivocal and quite 
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ca-^y tc use by compuicrs, The CID/Euraloin [6] Iried to sti- 
mulate the use of such an address by editors oi l ie nu- 
clear field. For instance the «ISuratoni Bulletins sliH uses 
the coden «EUBU» to define a bibliographical address 
from the moment of publication, the same with <<ATKE» 
for the German periodical «Atomkernenergit?». These at- 
tempts, of which the usefulness was however proved, faileci 
aimost'completciv at that time. However the benefit of sucn 
an address for the man and for the computer is obvious. 
Nevertheless the CODEN was used to draw up addresses 
for the abstracts published by the secondary literature, as 
example by the Biological Abstracts, but one never again 
proposed that editors themselves would give such 
dress under their responsibility from the publication. 1 lai 
would be the sole procedure in order to standardize the 

computer input. 

The idea of matriculation by care of editors followed an- 
other course. American books are matriculated by year m 
a continuous file and that number is printed by the editor: 
that is the «Library of Congress Number» of each book in 
Great Britain, Professor Foster, Dublin, has realized a 
.complete system for matriculation of book by the editor. 
That project is now well known as the ISBN (Internaii- 
onal Standard Book Number). Presently the IFLA (Inter- 
national Federation of Library Associations) works on a 

similar project. 

But the objective is not matriculation of articles publi- 
shed in the periodicals or of books. What the information 
community needs is a bibliographical address lookmg like 
a matriculation and usable by computer, but for any prin- 
ted matters, books, articles of newspapers or of periodicals, 
monographs or annual reports of companies and so on 
What is worth to be published is worth to be retrieved, and 
that means to-day worth to receive a bibliographical ad- 
dress convenient for computers. The support of institutions 
like the A. S. T. M., or the bibravy of Congress, or the 
IFLA is not enough: without international agreements, 
and with out the help of laws specifying the duty of editors 
in the drawing up of bibliographical addresses, we will go 
on working in disorder and encumbrance, for our compu- 
ters are not built to swallow addresses drawn up rmnutely 
by hand even under respect of bibliotheconomy rules. New 
structures are compulsory for efficiency of- new tools. 
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Thai problem, ;is nn answer to disGncumbrancc, is now 
the very urgent one, but one does not feel that it is yet 
taken under consideration as it obviously earns. 

Address of the specialist 

«A pocket memorandum-book, patiently set up during 
years, is a very humble tool which, il it held the ADDRES- 
SES of the more valuable sources, may be much more effi- 
cient than renowned ]ibraries» [7]. These «sources» can be 
the addresses of the more qualified specialists, or the na- 
mes of knowledgcd people who may give pertinent advi- 
ces, such as the bibliographical address of the more sui- 
table article, or the name of the more qualified information 
center. Tlie power of big companies lies partly on tlieir 
relationships with many specialists, which means to pos- 
sess a network of friends leading to information sources. 
At disposal for American citizens the National Technical 
Referral Center adjoined to the Library of Congress fills 
the mission to point out the searcher to the more qualified 
sources or specialists. But we do not think that attempts 
of that kind are already known at international level. 

Mecanographical listings or retrievals by computer can 
ease such detection of these «sources» or of these «speci- 
alists». That requests to define «sources» and «specialists» 
by their profiles. Once more we face the same problem: we 
ought to organize the standardization of ADDRESSES so 
that we can consult listings or computers more freely and 
efficiently. 

Address of depository 

Each year we are more and more aware of published 
documents by abstracts of the secondary literature. The 
main justifications of that practice are obviously a faster 
choosing and the ability of grouping documents under 
documentary profiles. A standard system of «need groups» 
profiles would ease considerably the «feeding» of each in- 
formation center with the material which is necessary for 
its activity. 

But it is not enough to organize the secondary litera- 
ture for a more exhaustive and faster retrieval of answers. 
We ought to find where original texts are at disposal. 
That requests listings establishing a correspondence bet- 
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weeii bibliographical atkircsscs and the addresses of llic 
known depositories. The absence of a standardized and 
mecanoffraphical system of addresses means more manual 
wo?k or^iorc obviously, no listing at all One may under- 
stand why such listings are st.U o-day lacking. Tl ^ L b- 
rary of Congress attempts to update such isting for tht 
deposit of American books in all the national hbraries. The 
matriculation of American books eased cer amly t»ie dr^;- 
whig up of such index. The Belgian Bib lotheque Royale 
Brufsels, made a catalogue of that kind for the deposit of 
any periodicals in public libraries of the coun ry 

One realizes easily the meaning of standardized ad- 
dresses for the makin'g of such an index if these addresses 
are convenient for updating by computer. 

Address of the user 

For the same reasons as above, it is useful that users 
reciuesting for information retrieval may be defined them- 
selves for the equipment by their own mecanographica 
addresses. That will ease the use of automa ic equipment 
like microfilms duplication thanks to automatic readmg o\ 
the name and address of the application , 

It is likely that a larger training with mccanographical 
addresses will lead to discover that other ones may also 
be useful for instance addresses of editors, or addresses 
of booksellers. ADDRESSES of a rnecanographical kind 
are the starting point for disencumbrance of communica- 
tion if of general acceptance. 

Among the five addresses thanks to which we may 
hope a more flowing communication, TWO are more cha- 
racteristic of information activity; of course they a r e Uie 
intellectual and the bibliographical ones. The THREE 
others are much more similar to simple mailing addressesr^ 
They will be more easy to treat, but they have also then- 
role in reaching the result. Tp. order that these last addres- 
ses may be convenient for the computer and for mailing, 
they could be studied in the line of the ZIP CODE. There 
is a being interest to make listings and mailings easier 

■ with the same addresses. 

Decongcslion of communication is urgent. Ihat requ- 

■ ests an answer at the level of the community. What we 
' ought to realize, we repeat here once more what was expo- 
; sed in ch-aptcr 1, is to organize for the benefit of anybody 
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the privileged position of the intellectual who is the owner 
of a private library. More, the use of the computer will in- 
troduce better infallibility than the human memory could. 

5. Present situation and projection towards 
the future: comparison 

Communication of information changed several times 
in slzQ during last decades. In the private library of the in- 
tellectual the communication was inward. The information 
center enlarged the circle but still in a defined way with 
limits in geographical extension and in intellectual profile. 
The information center appears as an autonomous entity. 
Slowly, as liic customers' needs extended, the center had 
to organize a better one-another-help with the exterior. 
But the feudality of «systems» puts the brakes on- harmo- 
nious externa! relations. We cannot meet the rational use 
of modern equipments as the computer, for we never care 
witli standardization of communication addresses. We need 
now to enlarge the circle to international relationships, but 
our internal and local habits imperil conununication in 
such a large network. 

It is useful to describe such situations in the each day 
living of the information center. That comparison is to be 
made between the present state of feudalism of systems 
and a future for which, as we may hope, the human beings 
would have organized communication thanks to standar- 
dized ADDRESSES. Such a list of examples may be leng- 
thened but the four following ones show clearly the way. 
— FIRST. The searcher himself is alone to be qualified 
for choosing what may give good answers to his informa- 
tion needs. But that may compel him to read hundreds or 
even thousands of papers before making a good find. That 
situation leads him to question the more suitable informati- 
on centers. But he knows nothing about the retrieval system: 
used by tliese centers. He ought to explain with many de-^ 
tails what he is searching for, and in which context hisi 
request has been formulated. And he has to give thesei 
explanations to somebody who knows nothing or very little 
about his own activity and target, but before all somebody! 
whose qualification is to be acknowledged with the retri- 
eval system of the center. Between the searcher and the in- 
formation scientist of the center, communication can be 
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nothing other than an uncomfortable one The consoqncncc 
may l)e - and often it is so -an unsuitable answer, an 
answer that gives information but not what the hrench 
word «rensciiTnement» means. A «renseionement» is disco- 
vered by the user himself and not by a too precise answer 
given bv information centers. -The situation will be dif e- 
rent if a generic system of «necd groups» categories allo- 
wed the searcher to define his needs not more in spccia i- 
sed words, but inside a general frame. By questioning witn 
a «need group» category, the answer will have less chance 
to escape the «renseignement». Need group ^;i''s^vci-s give 
information about and around the need and the ability 
of choosing remains the one of the searcher hnnse f: he 
opens lhe dialogue with the most suitable papers The ge- 
neric approach respects the personal behaviour of the sear- 
cher while the transit through the information scieniist of 
the center introduces a break in the communication. 

SECOND To-day the bulk of the published papeis is 
so heavy that the center loses any ability to make an 
exhaustive choice or treatment. Transit" by '^l^stract jour^^ 
is comnulsory. But abstract journals are redundant, 
and l^heiTSs'are built under rules of non-compatib e 
retrieval systems. To compare the abstracts given by dille- 
rent journals and to make the best selection ^^r more 
suitable answers may be -and is indeed - a ttring puz- 
zle. The situation will become quite easier xyith standar^ 
dized generic approach and bibliographica ^^^Idresses. T^^^^ 
abstrad journals will be sold and updated on miuocar Is 
while the generic indexation will be received on tape lo 
the c6mputer. The generic approach will not only respec^ 
the behaviour of the searcher, but the who e ^ol'eclio -o 
abstract journals at disposal becomes easy to be consu^^^^^^^^ 
by a single retrieval request. More, the dr ferent abstracts 
published for one paper will be identified by the computer 
thanks to the bibliographical address, and one niay c oos. 
the best ones. Standardized addresses organize implicitly 
the communication between abstract journals. 
~~ THIRD To discover the nearest deposit library lor 
articles which are recommended by computer re rieval 
may be slow or impracticable. We lack these listings 
which give relations between bibliographical address and 
addrosfes of depository. By 

computer might give answers by standard bibhogi aphi- 



Cell addresses, and lisLs may be used as input for ano- 
ther programme which will give the concerned addresses 
01 deposit. Papers printed as answer by the computer 
might be used as orvler sent to a microfiim iibrar>. Staii- 
dcjrdized addresses organize implicitly the coupling bet- 
ween the computer and the microfilm library whicir sells 
hard copies or duplication of microfilms. 
— FOL'RTH. Priyate matriculation numbers are used 
by any information centers tor internal communication. 
Tliey permit control of the communication between the 
document, the abstract, the indexing and the input on 
equipment (computer or others). But they are of no use 
outside the center. More, such a malrieiilation system is 
■not at all a bibliographical address and remains of no 
help to preserye against making seyera! inde.v:ings and 
inputs for the same document. — Things will be quite 
different with a standard bibliographical address which 
may be used for retricyal by computer. Then a one-ano- 
ther-help between centers will become faster. Detection 
of duplicate indexing will be easy by computer. The 
habit of priyate matriculation numbers will become use- 
less, and information centers will be able to file their 
abstracts in the alphanumerical order of the bibliogra- 
phical address, the same order for any center. That last 
possibility will be, as we may think, the powerful tool 
of communication between centers. In J965 already 14], 
the author detailed the numerous advantages of the bib- 
h"ographical address given by care of the editor. 

Without a sane standard ization of addresses, informa- 
tion will not easily reach the stage of a trade or of an 
industry. We do not know any other structure that can 
make communication faster, safer and more efficient. 
Thanks to agreements on communication, which means 
some restriction of autonomy, efficiency may become 
very large. The proposed structures may open 
the doors to new procedures for progress. One 
of tile more efficient will probably he the impLilse given 
to the literature of rewriting, of surveying and of state- 
of-the-art reports. Progressively the mission of the spe- 
cialized information center, which is lo set up a «da- 
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la biise», will extend till tlie «clala ])ank» mission. AnoLlier 
consequence will be, for any company, thai the industri- 
al information center will i)e relieved of a part of the 
burden of updating the collections, and will become more 
able to fulfil its real role in the company: the drawing 
up of surveys which bring an eliicient introduction to the 
decision. We may pronosticate that communication of in- 
formation, thanivs to these new structures, will change 
in size another time, allowing from that moment a more 
rational use of computers and equipn.ent. ■ 

Information will become «moving» and will find bet- 
ter opportunities to contact qualified and cognizant pe- 
ople, and to fructify. One will be more able to dissemi- 
nate the information and to use it in the outline of the 
definition given by Francois Perroux 181: 

«L'information est un bien de connaissance qui s'aclie- 
te el se vend, qui est produit et distribue, qui comporte un 
coLit et procure un rendement, ml est astreint a des reg- 
ies economiques en quelque orptnisation que ce soit» 

(Information is a good of knowledge which 
one buys and sells,, whicb is produced and 
dealt which requires a cost and raises an 
output, which is tici to economical rules in 
any organization wbalever it may be) 
We may think that a better standardization of com- 
munication thanks to addresses of general acceptance 
will ease the development of a my)re harmonious network 
of specialised information centers, with the objective of 
«reviewing» knowledge, whicb is the most significant 
proposal of the Weinberg Panel report 191. Such revie- 
wing of a field of knowlecige opens the door to the more 
suitable bibliographical references: «If you like to dig 
deeper see so and so» flOL 

These information studies are expensive, the more so 
because they have to be draum up by very qualified pe- 
ople. The only possibility to lessen the relative cost of 
such reviewing studies is to sell them to more customers. 
That is certainly an important justification for better 
communication. Information reviewing will become a bu- 
siness success as mucli as a procedure for progress and 
education. Trade of information claims for its place aside 
trade of books and periodicals. 
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6. Awareness by each of us 



Information of knowledge e\'en to-day is probably tlie 
single liuman activity «of commimication» which does 
not succeed in leaving tlie artisanal stage for the one 
-of an industry or a trade. Information activities are un- 
able to rise above the encumbrance level, for they are 
unable to amend by their own. 

It is likely that the .most important cause is a poor 
understanding of conditions of efficiency for computer 
and other modern equipment. 

It is not seldom but however surprising, to meet pe- 
ople confusing information itself with the document that 
is nothing else than the holder of information, or people 
confusing the «mcssage» and its «transmission». Such 
confusion, enclosed in our habits with the big role of 
libraries in the information field, carries its consequences: 
One had inclination., and one has still inclination, to 
entrust information treatrnent to those who are tradi- 
tioJially responsible of documents xadth the effect to .give 
to information the ccniiralized struiccture whicli is the one 
convenient for documents treatment. Slowly but surely 
awareness stretches around the world that information tre- 
atment requires a close contact witfi specialists of the 
concerned field of knowledge, that r.^itians an unavoidable 
decentralization towards these specBiiists. From that mo- 
ment the dynamic part oF the treatoaejit cannot come from, 
the document itself, but from the huge specialists net- 
work disseminated in aey active pEace for progress, for ■ 
practical application, or for research. One understands 
also that the bidder of information can in dissimilar 
circumstances become the creator or the seller of infor- 
mation. Everything is moving inside a network where 
dynamism cannot be kept up unless communications of ; 
a high standard exist. We must recognize that such dy- ? 
namism which is a result of communication, is really : 
running in the opposite direction with information treat- j 
ments centralized in more and more powerful knots far ; 
away from the specialist worker. Such duality of outlines : 
may imperil the result. 

As consequence of the above considerations, the so- - 
lution appears more clearly: it lays on a rational organi- ; 
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zalion oi communication nnd on standarchzcd addresses 
whieii will enlarge the use of computers and equipnicni. 

It is a reform of the structure. Perhaps some respon- 
siblcs presently in charge at a place o the \n ormat.orj 
chain will put the brakes if they do not understand that 
the best is to go with progress. 

The mission of communication appears as a main 
mission for the community. It ought to be ruled between 
eovernments by taskforces or delegations responsible for 
the solution to be chosen, but doing so in tl^ o.uilline 
of future and not in the frame of present obsolete struc- 

■^'^^With regard to the position of -each of us, the best 
we can individually take is to decide in avour of the 
dynamic conception of communication. Each of us mus 

^be aware that the reform will hai^-e a heavy ^J^^P^ ^] 
the future, and one can be warrant to have its erlort^ 
to adapt himself, rewarded by the result. 

It should be desirable that a group of people wuo 
are well sensibilized to that problem of structure wi 1 
set up a design convenient as starting point for an in- 
ternational move via the national governments. 
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D. B. Baker (USy\), P. V. Parkins (USA) 

and J. Poyen (France) 

THE FUTURE OF ACCESS 
(ABSTRACTING AND INDEXING) SERVICES 



«Thcr£. is a growing realization thai man's 
Tutpre ictay be litcraiilywhal te chooses to ma- 
ke it, and that the rang'es of choice and the de- 
gree of /onscious control which he may excr- : 
cisc ill dderminingr liis f.iitiirc arc iinpreccden- 
tetl.» John Mclhile 

I- History 

Over the centuries sdncriars and research workers, as. 
well as editors and iiijrairizsins, have recognized the great 
importance of bibliographic control of the literature via 
indexes, catalogues, lexicons, registers, abstracts, reviews, 
digests, and the like. The main purpose of these tools has 
■ always been to record, to organize, io alerl, to retrieve,- 
and in modern terms — to access — tlie information and 
knowledge of the universe. 

Indexes have been relatively easy to make and were: 

one of the first locating devices to be started by classical! 

and Biblical authorities many centuries ago. The first vo-l 

lume (1545) of Conrad Gesner's enormous «Bibliotheca j 

universalis^ was more of a classification system, a comp-j 

rehensive international dictionary, than an index, but hisl 

second and third volumes, «Pandectae» and «Partitiones» ; 

were subject indexes. Indexes were well established when? 

the first periodicals appeared on the scene. It was a logi-j 

cal and inevitable extension of the use of indexes to apply ] 

them to the first journal. The first volume of the «Philo-i 

sophical Thansactions», Royal Society of London (1666),! 

cnin*^^^ issued with a subject index. i 
hKJL 
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Tlie rr ;c\v journals in llie Inie 1700s mny be regarded 
as t!ie nricrmediate between Ibc eollective periodical inde- 
xes and the abstract journals. In 1771, in the review o\ 
«MediciMische — Chirurgische- Bibliothek» in Gernuiny, it 
is stated, «a nun-.ber of publications 1 have myself seen 
and £ibsi:racted. Most of the abstracts, however, are from 
the best journnls, and frequently literally transcribed, even 
when I had the book in hand and had read them through 
since this is in accordance with my plan.» While it is 
foolhardy to .claim the title of «first» for any bibliographic 
event, certainly the «Pharmaceutisches Zentralblatt» 
(1830) a forerunner of «Cheinlsches 2entralblatl», was one 
of the earliest abstracting and indexing journals as we 
know them today. 

In the twentieth century, abstracting and indexing ser- 
vices increased rapidly — som^^ broad and some narrow in 
the subject scope and in the range of journals covered. 
By 1930, most disciplines in science and technology and 
in the more technically advanced nations had some se- 
condary services. There were numerous studies between 
1930 and 1960 which report the extensiveness of the se- 
condary services in coverage of the periodical literature. 
It was not until 1963 that «A Comprehensive Guide to the 
World's Abstracting and Indexing Services in Science and 
Technology» was published by the U. S. A. National_ Fe- 
deration of Science Abstracting and Indexing Services^. 
This guide classifies and records the details of some 1855 
secondary services in 40 countries. A FID Guide (second 
edition, 1969) of abstracting services alone identifies 1300 
world-wide in Science and Technology (Vol. 1) and 200 
in Social Sciences and Humanities (Vol. 2). A joint revi- 
sion of the NFSAIS and FID guides is proposed for 
jg73_4 and estimates show an increase of secondary ser- 
vices to be 1.5% per year. 

ITowever in 1947, the United Nations Educational and 
Scientific Organization (UNESCO) undertook a program 
«to facilitate the improvement of scientific documentation 
and abstracting* in • accordance with its charter «to ma- 
intain, increase and diffuse knowledge — by initiating me- 
thods of international cooperation calculated to give the 
peoples of all countries 'access' to the printed and publi- 
shed materials produced by any of them.» One of the first 
international conferences on contemporary scientilic m- 
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fiormalion wns held by the il? yal S-ocicly iiT, [.(UkU^h in .lu- 
h;:, 19^8, wlicn two setr'noH.^- ; [ niicetings were licid on im- 
rr^nenients in publishiinig c ud dastributing of abstracting 
m indexing services. Resn 'ling from the *lJNESCO prog- 
T:mn and the Royal S(H'ieiy Cor iiereiice, an Abstracting Bo- 
anl was formed miidier tlie an s]:) ices of the International 
Coimcil of Scientific Unions (ICSU AB) in 1952. The pur- 
pone of ICSU AB at the beginning and continuing today, 
:is «to organize and promote tiie international exchange 
and publication of information in sciece and technology. » 
ICSU AB collaborates with other international organiza- 
tions with related programs, for example, in regard to the 
i:n.teraction between scientiifk disciplines and the use of 
modern methods of infoiniiiationrrprocessing: UNESCO, FID, 
IFLA, IFIPS, Internalioiml ©rganization for Standardi- 
zatiiion «ISO), and ICSU Coiinmittee on Data for Science 
and Technology (CODATiJj).. 

2. Curitent Status 

2a. Growth of Maierial 

In «Little Science, Big Science», published in 1'963 
by Derek de Sol la Price, he points out that the growth ra- 
te of printed abstracts lias stabilized at a level which ef- 
fects a cumulative doubling in each 15 years. This genera- 
lization can be disputed in particular disciplines, but it 
is indicative of a fairly constant growth rate over a long 
period of time. And if true, it means that between now and 
1986 as much material will be published as has so far ap- 
peared in print during the whole of scientific publishing. 

Although Professor Price was discussing the growth 
of published abstracts, this of course reflects a rate of 
growth of similar dimension in the primary literature. In- 
deed some discipline-based access services are aware that 
because of economic and other constraints, they have not 
always been able to keep pace with coverage of the prima- 
ry literature to the extent that they would have wished. 

lhr6ughout at least the last 25 years there have been 
regular predictions that the growth rate of the primary 
literature must decline, and that if this does not happen 
naturally, then some artificial restraint at the source 
would need to be applied. To date this has not happened, 
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"cilhcr iiaturriily or by artificial] Jiicnns, and tkc ■ . 
services would be poorly advised to base Uicix ir. 'n 
ning on such an assumption. 

2b. Improvemen! iti Tooh 

The massive volume of printed material •.yv -.miiixiQd 
problems not only for the processors of secanjai:;:: . imor- 
jnation but also for the users. Conventional; iiEEsiii xm of 
production of abstracts have been stretched to fe .L--ii!ti. as 
services began to handle a quarter of a miliio]:]:iTi?rT::iiv per 
year or more. Equally strained are the infornTaKiojirr .-.arch 
methods of users, for no scientist can hope Icr • - ace 
with all of the literature in his full discipline a:-:^ . r^-ed, 
he may face very serious problems in keeping 
literature in his own narrow field of specializaifitSn 

Fortunately, the becoming available of elec;r CKwm- 
puters has in this, as in so^nany other instant i,reen 
helping both tJie producer and the user. It is wr;?[ , in be- 
lieve that the availability of this modern tool pr )\ i- s the 
solution to all problems, but computers are ceit-n; - ma- 
king much possible whicli cc)ul(l not otlierwise liavi'. h.tHM^ do- 
ne. Firstly, it offers the opportunity for massive stcn-ii^^-e of in- 
formation in relatively small areas. It also proviiUes an 
efficient and rapid searching tool which means that retro- 
: spective searches of the literature which would have requi- 
red weeks or months by hand can now be accomplished 
.: within a few minutes. However, it does not as yet function 
; as the intellectual interface between the questioner arfiine 
■ system; the conversion of a question into language smtab- 
^ le for searching the computer file is still a very smial 
; job. Some experiments on direct retrieval by users lia:ve 
V been successful but these have so far been limited to re- 
latively small files. The ability of every scientist of search 
; a massive file is part of the planning of most large sys- 
tems; it is not yet accomplished routinely by any meaOiS. It 
IS one of the misfortunes of scientific language thaS sci- 
entists do not always use the same words to mean the 
same thing; otherwise the retrieval of information firriHiii a 
machine-based file would be a much simpler matter- 
Computers also offer facilities for selection from aiTnain 
file which can be searched in any way needed and pirodu- 
ced as magnetic tape output for driving photocamipasing 
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machines which in T'.irn can typeset the published material 
much more quici<ly and efficiently tiian conventional prin- 
ting machines. Tliey also enable current awareness sear- 
clies to be conducted on material shortly after it enters the 
file. This means that a scientist having a specific problem 
may be constantly alerted to anything new published in his 
field. Nothing in his speciality may be newly published and 
if so he will know this also; and no matter how infrequent 
publication in his interest area may be, he can be assured 
of its being brought to his attention. 

All of the foregoing which at the moment seems so ad- 
vanced and ambitious will doubtless in a few years scr?3n 
quite primitive. New methods of recording scientific in- 
formation different from the conventional ones are being 
developed and tested, and will be adopted. Methods aimed 
at machine-based services will be used and many of the 
problems will disappear. It is the almost impossible task 
of attempting to convert a system which has employed the 
same techniques for centuries to one using modern tools 
which introduces most of the difficulties. Complete re- 
think and redesign will be necessary. 

In addition to the application of computer communica- 
tion techniques, changing facilities for the rapid transfer 
of material from one place in the world to another, e. g., 
by communication satellite means that national barriers 
in information transfer can be broken down and opportu- 
nities for true international cooperation become more rea- 
dily available. 

The different languages still present a problem not on- 
ly of translation but of understanding what the same words 
mean to different people. An developments toward a lan- 
guage of science would be of tremendous benefit in . the 
transfer of scientific information. 

3. Future 

Sa. Relationship between Primary and Secondary 

Publications 

For the purpose of this discussion the information pro- 
cessor is defined as an organization that provides prima- 
ry and/or secondary publications and services. The prima- 
ry publications are the true repository and record of the 
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original, basic data and information in discrete, printed 
packajres. Secondary services provide iir<i;-aiijxed, subject- 
oriented access to. newly available ini"orination from a 
specified range of primary publications — serials, patents, 
and reports — and/or to tbe accumulated store of infor- 
mation available in sucli documents. Secondary services 
do not replace the primary record; abstracts and indexes 
are not surrogates for the primary documents. While so- 
me primary publications carry in each issue a section, li- 
mited in scope, of abstracts from other primary journals, 
the lar<ye, major secondary services liave developed as se- 
parate functions witii separate operations in the past two 
centuries, with very little coordination between these pri- 
mary and secondary processors. 

In the traditional publishing system, each information 
processor has independently established its scope of co- 
verage, editorial practices, organization of data, timeli- 
ness of reporting and publication formats. Cooperation 
among primary and secondary processors has been seve- 
rely limited in the traditional system because of the manu- 
al transcription processes which are required to transfer 
information from one operation to another and which ha- 
ve prohibited such cooperation from being economically 
feasible. 

In the developing computer-based information-handling 
systems, however, the functions and component parts of 
the processors must be increasingly automated and coor- 
dinated. Such automation requires fully defined self-con- 
sistency within each processor's operations and highly pre- 
cise recording of the information handled within each com- 
ponent. Self-consistency does not necessarily imply identi- 
cal practices for all processors. However, exchange of 
machine-language records between two or more processors 
does require that each processor operate at the same level 
of detail and that each operation follow its own defined 
practices consistently. 

Many experiments have been undertaken by the proces- 
sors throughout the past decade to further compact the 
information, shorten the time in process, decrease duplica- 
tion of intellectual effort, reduce costs, increase the num- 
ber of papers or abstracts handled, increase utility of in- 
dexing, and increase reliability of the recorded information. 
In addition, a majority of primary publishers are now pro- 
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viding al,)stracis along with the full paper. Most arc using' 
more infornialixc titles tlum in the past. Some arc using 
codes to identify uniquely tlieir journals as well as each 
paper witiiin tlie journal. Further, some are experimenting 
with the generation of approjiriate index terms, diagrams, 
fornudas, and nojnenclature to be used jointly with secon- 
dary services. 

The growing practice among primary publishers of 
storing part of the autlior's data in a depository in lieu of 
pLdjlishing it presents no serious problem to users so long 
as the deposited material is made available to the secon- 
dary services and is widely and easily accessible to lib- 
raries and individual users. In recent years, some primary 
pid)lishers also have started single-article announcement 
services whicli permit users to order relevant articles on 
the basis of titles and/or concise summa'^'.s. 

Hopefully, the experiments and plans being made today 
by information processors will lead to sound cooperative 
systems of the future. These plans are designed to unco- 
ver and solve the ])roblems of primary and secondary pro- 
cessors working together in tlic development of a total in- 
formation processing system. As the literature has grown 
and as editorial and production costs have esv'alated, it 
has become mandatory to eliminate the duplication of in- 
tellectual effort which goes into the analysis, editorial, pro- 
cessing, and indexing of every scientific document — steps 
at present largely duplicated by primary and secondary ; 
processors. 

The experience gained to date has enabled primary and 
secondary services to design and start building more ef- 1 
ficient systems to interlink the processing steps. It seems | 
likely that such interlinkage of operations will be further : 
extended and broadened in the decade of the seventies both ;■ 
on a national and international basis. 

In the meantime, some other steps can and will be un- ! 
dertaken as the computer-graphic processing equipment 
and computer-based information-handling procedures are 
installed by more and more information processors. It is : 
being suggested more each year, as the pressures grow, i 
that the full primary record of the data and information 
can be deposited in microform, and perhaps even in the 
digital form eventually, througliout the world in the large ^ 
central libraries. The alerting and retrieval access func- • 
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liens by soino combined primary niitl sccoiulary services 
will perl Kips he well along by 1980. These processes will 
necessarily be well coordinated with aclivities of the in- 
formation ceiiters tliat cm'rently are being established 
worldwide lor c\-ahialing or selectively disseminating in- 
formation as well as with the libraries whicli provide the 
original dociiniont handling and back-up services for the 
full record. 

Tliesc trends and advancements will certair.ly offer the 
nsers the following advantciges: increased reliability, imp- 
roved access, more timely awareness, improved res[)onsivc- 
ncss, and reduced costs and better o\'erall economic per- 
formance of the system. 

3b. New Trends for Secondart/ Services 

Tlie power and flexibility of the new ijiformation sto- 
rage, retrieval, and dissemination tecliniqiies that bave 
been developed or now are being developed are having a 
tremendous effect on abstracting and indexing methodolo- 
gy. The major secondary services are undergoing rapid 
cha' :,. and accpiiring new, more extensive information- 
proc. ■ ing and disseniinnating ca jiabi litics. The use of the 
compuler and grapln'c devices to repackage information 
has i^roven to be a most important dev-ilopment of the 
1960's in its effect on the ability of the abstracting and in- 
dexing services to l^etter meet users' requirements. The 
variety of possi1)lc forms of output from these systems is 
virtua 1 ly unliniited. 

In these computer-based systems, information selected 
in a single intellectual analysis of the source documents, 
an analysis which combines both abstracting and inde- 
xing, is put into a unified maciiine-manipulatable store 
through a single keyboarding operation. Then from the 
unified bank of information, material appropriate for 
special-subject alerting and retrieval can be drawn, lar- 
gely by computer program. 

The large, conii)reheiisive, printed issues and volumes 
of abstracts and indexes continue at pre- ^nt to be the most 
used — and, therefore, best supported — products of the 
secondary services. However, the distribution and use of 
microform and magnetic tape versions of die comprehen- 
si\'e volumes as well as selected subsets of data are incre- 
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asmg cacli year. Some secondary services, in addition, arc 
attempting to provide specialized bibliograpiiic and de- 
mand searches directly to tlie user communitv on a broad 
basis Other secondary information processors are provi- 
clmg their data bases in a gross or semi-refined form to 
miormaLion dissemination and/or cvahiation centers which 
turther refme the information, often combining it with 
oiner_ data bases, and provide selective dissemination to 
individLia! users or a particular community of users The 
numher ol these information centers with computer pro- 
cessing capability is increasing each year. Over 100 such 
centers were in operation in the United States alone by 
the end of 1970. Many nations have established or are in 
iiie process of establishing one or more such information 
centers to serve the various scientific and technical disci- 
plines; he number of centers depends upon the size and 
needs of the nation. In some instances, several nations are 
cooperating m the support of a single center. 

It is too early to predict the exact role of the seconda- 
ry intorination processors in the futui e — whether they 
wil! continue to provide a full range of publications and 
services directly to the information users or whether a 
substantial portion of their services will be provided 
through mtennediaries such as information centers. The 
options are many, but coordination at all levels will be- 
come increasingly important as the use of computing s*'- 
stems grows. i t. <• 

3c. Future of Publicaiions 

Let us start by saying that for the foreseeable futu- 
re, some form of the printed media is here to stay. It is 
hard to conceive, so long as the human race continues to 
be so mobile, that anything can replace the portable prin- 
ted medium. The facsimile delivered to the home, the of- 
fice, or the laboratory is only a partial substitute, though 
It will undoubtedly become a reality. 

There will be an increasing opportunity io have elect- 
ronic or graphic presentations of information in the home 
or office via terminals, supplemented by printed backgro- 
und material in hard-copy form. The major obstacles to 
this form of communication today are the .ligh cost and 
the lack of satisfactory terminal and electro-mechanicql 
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oulpul equipmeiil. U is doubtful lliat the kcliniciil prob- 
lems will be overcome and he costs reduced to a reasonab- 
le level before the end of this decade. 

In the future, people will read more because they will 
have more time to read and they will need to learn jnorc, 
although their reading will probably be in a context comp- 
letely different from today. These reading patterns are al- 
ready changing, with the integration of electronic commu- 
nication and printed and graphic material witli various 
audio visual presentations. 

Undoubtedly, these changes in reading patterns, and 
a trade-off between the printed pubi'ications and the 
electronic communications, will continue. The. informati- 
on processors will need to keep close to research and 
changes in reading habits of the users to adapt the con- 
tent and form of their output. 

3d. Future of Tools 
3dl. Hardware 

As used here, hardware is defined as mechanisms of 
all types used as tools in «access service» processing. In- 
cluded are computers, photocomposers, and input devices, 
as well as the software necessary to control and operate 
these hardware devices. There is no question that, elect- 
ronically, hardware is getting more powerful, faster, che- 
aper, and smaller, much as has been projec* for many 
years. All recent technology forecasts contii-.ie to claim 
these trends, and the advertisements of new equipment 
are consistent with the forecasts. 

The ability of routinely available input and output 
hardware to handle letters, numerals and other typograp- 
hic characters now has expanded from the early 48 to 
64-character sets (upper case only) to the 95 characters 
(including upper and lower case) of the American Stan- 
dard Code for Information Interchange (ASCII). This 
does not yet satisfy the full range of character require- 
ments for many access services, which are surmounting 
the problem through specially designed printer and pho- 
tocomposition units. While the basic electronics capabili- 
ty for expanded character sets exists in various cathode- 
ray-tube devices and photocomposers, a mass market for 
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such devices is needed io encouraov .-iddiiioiKil develop- 
ment and rensonahle cusls. 

or iKirliciiii]!' concern to iiiiormntion processors is tiio 
development of computing eqnlpmenl niejnory wiUi res- 
pect iioth to size and cost, init the capat:)ility is greatly 
improving eacii year. A more difficult task is ogranizino- 
the information to l)e stored and providing suitable su|)- 
porting software. TJiere is much work being done in this 
area now. This is a key problem facing all access servi- 
ces in Ihis decade. 

Input mechanisms are developing verv well. Graphic 
input devices are in their early de\-eiopment stages. Voice 
input devices will not be technologica li v or economically 
leasible for large-volume input in the "near future. The 
most important advance for access services in the short 
range, future will be to capture as much of the input as 
possible as ])art of the primary publication process. 

Microform production and handling mechanisms com- 
prise another important form of hardware for the infor- 
mation processor. The role of this equipment in informa- 
tion handling, whether as processing aid or distribution 
medium, is still not major. However, as judged from the 
latest conferences, the variety, capabilities, and prices of 
microform equipment arc improving very rapidly. 

3d2. Auto m a t i c I n d e .\ i n g a n d Abstract i n g 

Text ])roccssing by machines to provide functions equ- 
ivalent to abstracting, indexing and other forms of lin- 
guistic analysis has long been a subject of research. On- 
ly a little over a decade, however, have serious efforts be- 
en directed to the use of machines for «automatic. inde- 
xing». At this time, «automatic indexing» in the form of 
KWIC and coordinate indexing is widely used. These in- 
dexes provide users with enough access points to the li- 
terature to be somewhat acceptable, and at the same time 
they are economical to produce. However, these methods 
do not meet the criteria of excellence and completeness 
or compare favorably with some index methods currently 
carried out by hunjans. Thus, the problem is how to im- 
prove the relationship between performance and econo- 
mic efficiency so tliat one can develop computer-based 
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in(le.xinf( systems with the dcfjrec of sophislication nee- 
ded to satisfy user recjuirenients. 

The dilfieulty of the process of juitonuiticfilly seleelinrj 
index headings' nnd the problem of transformino- sourec 
index- material to a format suitable for modilieation under 
these headings have delayed the use of automaticinde- 
xing. Present indications " are that computer-assisted 
indexing techniques may be more feasible than complete 
automatic indexing directly from original documents.- It 
seems natural that if we let the human analyst retain tiic 
task of selecting meaningful jheadings from titles and 
sentences suitable for computer processing, the tedious, 
time-consuming, and difficult procedures of articulation, 
permutation, and the like could be performed by the com- 
puter. 

The preparation of an abstract from a full report can 
he a difficult process, especially if one demands from the 
reviewer a critical resume or an evaluation of the repor- 
ted work. Oi. the other hand, preparing indicative infor- 
mation on what an original document is about is a much 
simpler operation which can in many cases satisfy the 
needs of the user for information. It has been demonstra- 
ted that machines, with the Iielp of a few rules of seman- 
tic and linguistic origin, can extract sentences from ori- 
ginals just as humans do, although the indicative al)st- 
racts produced by extracting key sentences should proba- 
bly be called «extracts» rather than abstracts. The algo- 
rithms developed to extract these phrases and sentences 
from documents are based on approximations of human 
procedure^ involved in the process, for instance, of un- 
derlining some important passages from a book. 

Such computer-produced extracts have the same or- 
ganization and vocabuhry as the original, and for the 
extraction process to be effective, the original documents 
need to be written in an expository style and have a lo- 
gical organization. These arc generally prerequisites for 
publication of scientific papers, which makes automatic 
extracting widely applicable with little, if any, pre-cdit- 
'iig. 

The lack of ready availability of primary aocuments 
in machine-manipulatable form currently limits the appli- 
cation of automated extracting methods, but with increa- 
,sc(l use of computerized primary publication processes. 
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• tlic input ni<']leriti] will become aviiiliiblc. lack of inter- 
sentence coherence is another problem of automatic ex- 
tracting. This could be solved by human editing; the pro- 
cess of producing extracts with the aid of a computer do- 
es not need to be completely automated. Finally, seman- 
tic and syntactic criteria for algorithmic extraction still 
need some nnprovement so that more reliable extracts 
with smoother transitions can be produced. 

Thus, the present state of the art indicates that auto- 
matic or semi-automatic indicative extracting of scienti- 
fic articles, may be possible in the near fufure. It will 
come at fn-st with the goal of reducing, bv mechanical 
means, the size of textual materials so Ihat^readers may 
be advised without much effort whether they should read 
the entire original document or not. 

On the other hand, informative and critical abstracts 
will_ continue to demand sophisticated semantic and syn- 
tactic rules for content analysis and deep structure un- 
derstanding, all of which are highly intellectual orocesses 
quite difficult to approximate by machines. Such a pro- 
cess IS as complex and difficult as automatic, natural 
language translation and speech recognition, which are 
still at an academic research stage. Many problems re- 
main unsolved, including the need for further development 
in linguistic theory significant to automated language 
processing, before such processes can be considered by 
non-academic institutions. 



3d3. On - Line Users 

On-line access to information, which persists as a 
dream of most information users, is today a reality only 
for a favored few and for a limited scope of material 
It IS, however, a correct vision of the future for many 
with the only unresolved question being «when?». The ti- 
ming will depend upon many factors. Some users cur- 
rently have access to a major computer system that can 
be used for on-line information activities on small fi- 
es, and this capability can be extended to large files as 
large capacity direct-access memories become less expen- 
sive. However, for most users, on-line searching will 
depend upon the development of cheap, high-capacity 
communications and terminal facilities to provide access 
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to filc^: mainlained remotely. These arc technically feasi- 
ble at this time, nuiclr as space travel was scientifically 
feasible in the early I960's, but there must be a sufficient 
incentive to justify the development costs. 

As important as the world scientific and technical in- 
formation complex is, it does not offer a large potential 
market. or reveniie potential as compared with that for te- 
lephone communication or television transmission. Thus, 
wliiie the technological problems that must l)e solved to 
make wi'.espread use of (jn-line systems a reality are 
no more diflicult than those that have been solved in 
some other area of electronic communications in recent 
years, the cost of solving them must .be borne by a much 
narrower base of support, and the social and economic 
incentives for solving them do not seem to be sufficient- 
ly strong at this time. Moreover, not everyone has the 
aptitude or training necessary to make effective use of 
on-line systems. If these systems can be inexpensive 
enough, this probably is of no particular consequence, but 
if a large expense must be justified, it will be necessary 
to show a significant improvement over other methods of 
access. The on-line retrieval system is really no more 
than a vehicle that the user pilots through information 
space, and the effectiveness of the results depends upon 
the pilot's skill as much as the inherent capability and 
organization of the system and the files within it. There 
is a basic difference of approach between the «continu- 
ous-absorption-to-be-prepared-for-anything» method of 
using information, which is traditional current awareness, 
and the «request-what-is-needed-on-demand-and-don't-wor- 
ry-about-irrelevanctes» method. The difference is the stra- 
tegy of use interactive systems. 

3d4. D i g i t a 1 a n d Satellite Communication 

At some point in the future (as yet an unpredictable 
time) digital communication facilities, including satelli- 
tes, will become an electronic marketplace for informa- 
tion and computation services. Conceivably, such could 
provide the «transportation system» which enables geo- 
graphically remote specialized services to be brought to 
the information consumer. When such a situation deve- 
lops, the «acce5s services» will find it desirable to enter 
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{'hat iiiJirkelplacc, ciflicr dirocily, of indirectly tiirousii in- 
fornicilion fi'*;{ribulioi'i centos. 

One can imni]jinc this type of fnlurc and [ can be 
c'^'si^rned. However, il is not yd time for tiie access 
services to take any special steps in that direction. The- 
re is cnonti'h to do now in rontinizing- and restrncturing 
, resent proco: >cs oiiJ services, ll this restructuring is 
well done, it will be applicable for extension to a com- 
munications-oriented world, if and wlien that world deve- 
lops. If it does develop, it will not be uniform worldwide 
because of the uneven state of technolof^-y advance in 
various countries. Thei'efore, services with worldwide in- 
terests will have to operate in a mixed marketplace. 

4. New Trends in Coverag-e 

Traditionally, secondary information services have de- 
veloped by discipline and by language. In some count- 
ries such as I--rance and the U.S.S.R virtually all fields 
of science and technology are encompassed by one cent- 
ralized information service; but in others such as Ger- 
many, the U. K. and the U.S.A., separate services, usual- 
ly based on professional societies of scientists have been 
created as the demands for them have arisen. 

This means that, for example, there arc separate ser- 
vices in physics in English, French, German and Rus- 
sian languages, with extensive duplication of coverage. 
Ailhough this duplication in coverage may in some cases 
be desirable, the procuring and handling of the same 
material many times in as many different places may be 
wasteful. 

_ Another more recent development has been that of 
mission-oriented access services, devoted to the sup- 
port of such fields as atomic energy and oceanography, 
where information services are required which span two 
or more disciplines as necessary ,to cover a particular 
|)roject. These services may be national in scope or in- 
ternational. «Nuclear Science .y\bstracts» offers an ap- 
propriate example of the former; in the international 
field, INIS is a new and good illustration. Many of to- 
day's problems, because of their breadth of interest, are 
interdisciplinary in nature. Thus in the future we can 
expect greater emphasis on muUidiscipIinary svstvms. 
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11 is fnri',r-ilc now (here is a relalivuly new and 
<^ru\vin<r rcfo^iiilion lluil, in proper coniiMiintion, niain 
of the' Ion"- -.landin^^ iliscipiine-based servicos arc ca- 
l)able of providing the subject coverage needed to esta- 
blish a mission-oriented service. For whatever reason, 
in the past we have witnessed a tendency to create anew, 
to bCfXin all over again without taking advantage ol in- 
formation sources already available, 'lo some extern the 
discipline-based services are tliemselves to blame, oi 
they- have tended to pay insufficient attention to develo- 
ping compatibilitv among t!ic services, and the probtem.s 
of converting this material so that it can be used m one 
file have been tremendous. Now, however, (liscipline- 
based services are recognizing their responsibilities, are 
cooperating much more actively, and joining together in 
many aspects of their future planning. One can suiee 
without exaggeration that even now, it should be possib- 
le to meet most future requirement for mission-orienied 
access sc'vices from a combination of selections iron 
discipline-based services.. Plans should be laid to uiai 
end. 

It is difficult to predict wdiat mtiltidisciplinary servi- 
ces may be required. !t should not be necessary to j)rc- 
dict however, since the coverage of all scientific lields 
is reasonably comprehensive. With relatively slight mo- 
difications and additions, with wdiere advisable some 
chanp-cs in emphasis, it should be possible to meet de- 
mands as they arise. Considering a current example 
research interests in space exploration have m large pait 
shifted to ecology, the latter loosely defined in a variety 
of ways. Although newly highlighted, ecology is not new- 
ly discovered. And preserved in the files o s^yera discip- 
line-based services is a massive store of vital miorma- 
tion for those interested in this field. Furthermore, the 
machinery already exists and is operating to capture all 
scientific writings of ecologic importance, and to analyze 
and store these appropriately for future retrieval. Such 
«stores» can be quickly tapped to meet current deman^^^^ 
It is in this direction that the development of mission- 
oriented services must move, if it is to be possible to 
bring adequate information promptly to bear on the 
w^orld-s complex changing, multidisciplinary problems. 
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5. Cooperation 

5a. Present Silualion mid Needs [or Cooporaiion 

Before looking at tlie fulure of access services from IHe 
coopcalion point of view, let us look at the p esent si 
alio,, and identify some of the needs for cooperation 

lias lieen-L'l-n °' ^'™'^'y literature 
ries TuK ?f ifn ^'".^•''1'°"'=""^' ^.te for several.cen- 
.1 re ,K ' , ''""^ expensive lo 

snr's 1 ' <''=^^™iinate this information 

e niis.r ^'-'V "f knoNvlcdfje, who- 

0 Ze o ''''''"'■':"'?"'f,''- inneasein 

s dd ti, n. 'mT''"7""^ '," being rendered 

L ie d n nf '1^^,.'^'^™'^' '° ™°st specialists in 

manaJers r, P^'ici'lar to most 

a e fhl^ri nf'V'i-''"^' be able to 

st-md rrk If ^ " '1''''= '° "I'-'iit^'in high 

don^con J'l',! ;|ie 'Alternatives? One would be to aban- 

lant Sr"^ / ie''?' ^'"7 T'^'^e more or less con- 

s ant or at least at a .groulh slower than Ihe growth of 
the Hera urP of the fie.,;. This means that the pSage 

dec ease Sn.?'"'"' ""'ff' ^""'^'^ gradually 

access trv ' '° ^ proliferation of 

^aDs In 1"''^"'' '^'■^^'ed to cover the 

users' needs ""^ 1'™"'^"*'°" ^"'^ "elp meet the 

trarv ,t ?l„f?7 ' '^"^"^g^^ ^"'l disciplines. On the con- 

service, tn f 1 '".'""'^ ^* numerous access 

services to find the information desired. 

tim^Z'l'f.""^™,-/'''!,'™"''' or decrease 

user nd T/ '^^.,'^"? ^S"" ""^ ^""'d help the 
users and would lead to chaous eventually 

sure^s"fn^'L''°''"'"''y• 1° "f'" discussed, is to take mea- 
of primarV m'f °' *° '""i' rigorously the amount 
01 primary literature issued. It is generally agreed that 

kicreasin. ''""i".' P°P"l^tion of the world is 

and teZt.i " ''"^ °^ eivihzation in this scientific 
and technical era, as well -and, therefore it can be assu- 



..D;'v. th:.:ii the litei ahirc will iollow this growth in one ir ■ ui 
or iiiiothcr. 

Such a situation would seem desperate if there was not 
another factor, paramount in the overall system, e. dii- 
pHcation of efforts. 

Until recent vcars. the access Sxvrviccs have 'gf^."-sii ,u;* 
independent of ohk' anoiU^^f. This ha:s led: to great -t^m:^- 
ci^Xi-m M efforts^ withh'f nations and among nations. The 
sarnie material is acquired, analyzed, and processed in so 
many different places that it is impossible to obtain factu- 
al data on how many times a given paper is abstracted 
and indexed. The total amount of money thus spent all 
over the world on duplicating one another's efforts is enor- 
mous. This sum is considered to be many times greater 
than the amount necessary to render the best possible, coor- 
dinated services in all fields of knowledge anid to develop 
the highly sophisticated tools and modern: which 
would increas-i- sieasureably Hi:- ^ffiLieiu-' Jid d:u?.ctive- 
ness 0!= :^tirk .scrvS^-es. 

Also, the independent growth of access services leads 
to a situation where some fields are covered sev.:eml times, 
and the users in these fields have their choice of services, 
while other fields are covered only very par£;aiy or not 
even at all. 

The most obvious and, perhaps only, remedy Ic improve 
this overall situation is to develop cooperatiomand coordi- 
nation among the access services, as well as nvith the pri- 
mary pulbications, libraries, information centers,.:_and other 
organizations involved in the information tranHler chain. 

The idea of a centralized system is not new. One car- 
go back to an International Catalogue Conference orga- 
nized by the Royal Society and held in London in 1896, 
where the concept of a world center for information- was 
recommended which would cover all fields of knowledge 
and render all the possible services in all disciplines and 
all languages. As attractive as such a concept can be in 
philosophy, it is obvious that it is totally unfeasible today. 
It would lead to ask the existing services to stop all then- 
activities in order to be replaced by a mammouth center 
which would be created from scratch. One can easily con- 
template the size and complexity of the technical, financial 
and political problems which would be created. 
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Jlii-s !.-> XV y the concept of a world center lias now been 
M^laccd by the concept oi a world network of existing and 
future services. W itiiin sucii a world-wide nctworl^ the 

■ '(ho hl"Ti.-''"r Tr^^J'' ' cooperative manner to avoid 
he duplication of eflorts and to identify the gaps and co- 
\ei Iheni accordnig to a mutually agreed plan. 

5b. Bilaleral Activities and Associated Risks 

<^ovJ!'':JT'^ 1 '"T^'f^'''' "'-^^^1^ ^^'^^ so^^ie access 
sei\i OS have ah-eady started to develop limited networks. 

U;eiii]cai Abstracts Service, Internationai Atomic Ener- 

rn(^rrn\r^ '^^f^- '''^'^ ^^^^^ MEDLARS, VINITI with 
LUMECON countries, etc., have started to cooperate with 

•!in''-*''r'' ^!^"^"y, covering the same fields, and to 
dCNclop inlormation sub-networks, which can be described 
ill iin over-sunpi]fied way, as follows! ' 

In one country_(,r re-ion, ao-reements are made 
with local services which are responsible to co- 
ver the source literature from that count y or 
region and to send the information to the cent- 
^o «oiv;^ii'''"'^n-"'^; ^'^ ^^'^ ^^^'-vi^^'-s 

ihP m L f' ^?''c'^'"g purposes to answer 
the questions for their community of users 

There arc major problems which have to be solved 

MicJi as standaids, deimitions of coverage editori-il and 
rmat policies, etc., so that the infoniiatio^^v li h t cen 

sysL^^^^ specifications and requirements of the 

cenl^nf/'°'''T ^^''l ^^'''^ standards vary from one 
in Fran '^^^ n f'"'' ' '^^^^ ^^"^^^ started 

Although such bilateral and .multilateral agreements 



provomcnt in an over-all access system for Ihc long-range 
future. 

Let us imagine what the futui'e would bo, if the uncoor- 
dinated scheme was to be followed. There would e.xist a 
significant number of central services each covering a 
given scientific field (like CAS) or technical mission (like 
lAEy\), 



Central 
serviCdS 



CAS 




MEDLARS 




IAEA 1 



Loca? 
services 




: MS 
raHner 



France 



Csrmnny 



USA 



This seems fine until one considers that there is a sig- 
nificant percentage of duplication among the coverage of 
the central services as well as in the coverage of local 
services in each country or region. 

It is true that there have been some improvements in 
recent years. The French literature in atomic energy, for 
instance, is not acquired, scrutinized and processed at the 
same time by centers in all countries of the world. It is 
processed once by the French IAEA correspondent, but 
atomic energy literature may also belong to engineering, 
physics, biology, etc:, so that the total French literature 
may be processed by the various local French services 
several times. 

As has been described in 4 above, the central services 
in mission-oriented areas will proliferate, because of the 
increasing needs for project-oriented information. Thus, 
the duplication of efforts if uncoordinated in a country or 
region .will also increase and result in much inefficiency 
and eventual chaos. This is the most compelling force at 
work in favor of the concept of a world network of exi- 
sting services which consists of havhig «a mutually agreed 
upon, world cooperative plan». 

The above is not to say that bilateral cooperative ef- 
forts are useless. They help to identify the problems rai- 
sed by cooperation in practical, operating modes and 
lead to solutions of some problems and in some areas. 
From the evolutionary point of view, bilateral arrange- 
ments and sub-systems must be developed and encou- 
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raged. Bui what llie. information world really needs ]^ 
to develop cooperation among the central processing", 
access services in the framework of long-range, world- 
wide plans and goals. 

5c. The- World Plans 
5sl. U n i s i s t 

In 1966, the President of ICSU proposed to the Di- 
rector General of UNESCO that the two organizations 
jointly study the feasibility of a world science informati- 
on system. A Working Party in January, 1967 established 
the guidelines for the study and defined any «system» 
to be considered in the feasibility study in terms o! 
«a flexible network based on the voluntary cooperation 
of existing and future information services. » Furthermo- 
re, «feasibility» as used by UiMISIST, was understood 
mean political and economic feasibility, rather than tech- 
nical feasibility. 

A Central Committee and Working Groups were es- 
tablished. The most germane to access services were the 
joint UNISIST— ICSU AB Working Group on Biblio- 
graphic Descriptions and an Advisory Panel composed 
of representatives of large operating service most of 
which are also member services of ICSU AB. In addi- 
tion, the Central Committee commissioned studies on 
comparative classification systems in the sciences and 
another on the feasibility of an international register in 
machine-readable form of--scientific journal titles. 

The Centra! Committee also commissioned Prof. 
J. C. Gardin of CNRS to draft a final report of the 
feasibility study. This report is now being prepared for 
publication. Also, a synopsis or shorter version of the 
Committee's recommendations is in preparation. 

The Director General of UNESCO will 'convene an 
Intergovernmental Conference in -Paris in October, 1971, 
to consider the recommendations of the Study Report. 
It is expected that the Conference, after reviewing the 
recommendations in full detail, will make recommenda- 
tions to the Director General of UNESCO relating to 
their implementation. ' 



1G2 



The 21 recomm^- ^da lions in the Study Report wliicli 
have to do with potential information programs are or- 
ganized in five groups. Group one contains 6 recommen- 
dations, calling for the development of tools for systems 
interconnections. This group contains a concentration of 
recommendations of concern to the access services. For 
example, Recommendation 2 calls for the development 
of an International Serials Data System. Recommenda- 
tion 3 relates to standardization of elements of biblio- 
grapliic description. Recommendtaion 4 calls for a rene- 
wed attack on the problems of subject specifications in 
the sciences. Recommendation 5 concentrates on resol- 
ving some of the problems of incompatibility among 
systems. 

The second group is concerned with the support and 
the strengthening of existing services. Recommendati- 
on 8 follows since it refers specifically to abstracting 
and indexing services: «Ongoing experiments and pro- 
grams aimed at increasing the effectiveness of abstrac- 
ting, indexing^ and translation services, geolinguistical- 
ly disciplinary or mission oriented, sho:ild be publicized 
and encouraged. Particular support should be given to 
cooperative schemes resulting in an international sha- 
ring of the work and product of such service, as essenti- 
al building blocks of the world wide information network 
to which UNISIST is dedicated. » The accompanying text 
refers specifically to ICSU Abstracting Board as the nuc- 
leus of an action group to which UNISIST must look 
for the implementation of these recommendations. 

If the Intergovernmental Conference succeeds in its 
mission as planned, it may be anticipated that the sym- 
pathy and understanding of governments will be directed 
to the support of scientific abstracting and indexing. 

It is not possible to discuss here all of the recommen- 
dations of the UNISIST Study Report, and most of them 
have interest for the access services. One can only com- 
mend the complete -report to the attention of the reader. 
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ICSU AB Plans 



The ICSU AB is an international scientific organiza- 
tion which federatek xepXQsentatives of ICSU, represen- 
tatives of international scientific unions belonging to 
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ICSU (who bring into llic ICSU AB the point of view of 
producers iind users of scientific iiiforniiition) , and re- 
prosenlylives of llie miijor ope'-aling- access services ail 
over llie world. 

ICSU AB's role in a world information system con- 
sists in being- the specialized agency dealing "with abst- 
racting and indexing services. To this end the Board has, 
within the framework of UN IS 1ST, launched a program 
for a world system for abstracting and indexing services, 
consisting of three parts, covering; (I) Input, (11) Pro- 
cessing, and (111) Output, respectively. 

The concepts of the plan for developing cooperation 
at the input stage, which follows, have been unanimous- 
ly adopted by the Board in July, 1970, and endorsed by 
UNISIST. The Board is now working al designing in 
detail the studies necessary before implementing Phase 1. 

The world system for abstracting and inde.King ser- 
vices is defined as a conceptual entity; it can be thought 
of as an aggregate of the major, existing abstracting and 
indexing services all over the world. Thus, the overall 
input into this systen"; is the aggregate of the input of 
all the individual abstracting and indexing services and 
the coverage of these _ services. The ideal situation and 
Inng-range goal of the input plan will be the complete 
sharing at tlic input stage by the member services with 
no duplication and no areas of incomplete coverage. The 
ideal situation wdll also be such that each access service 
inputs into the system, bibliographic descriptions plus ab- 
stracts plus indexing terms which will be standardized and 
in machine-readable form. 

The ICSU AB plans to develop compatibility at the in- 
put stage, initially concerns the periodical literature which 
forms the bulk of material processed by the access servi- 
ces. In earlier studies, the member services have identified 
and established a relationship with journal productivity as 
falling into three classes; high (core), medium, and low 
(fringe) productive journals. It is contemplated that a cen- 
tral redistribution unit could sort, the information accor- 
ding to the «field code» (e. g., member services identified 
for certain classes of material) and send it to the intere- 
sted abstracting and indexing service. The central redistri- 
bution unit may not need to exist in fact, if after the stu- 
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dies it is dclGj-miiicd that cnch member service could more 
efficiently undertake tliis role. 

The first stage plan defines guidelines for cooperation 
among tlic member services in acquisition, selection, and 
exchange of documents for coverage by tlie services. This 
is aimed at eliminating much of the duplication in journal 
acquisition. 

Under this plan, tlie member services would assume the 
responsibility of acquiring and selecting articles for cove- 
rage from the most productive journals in their discipline. 
Articles in those jourtials that are of potential interest to 
other disciplines would be forwarded in microforni, along 
with the author abstract and a standard computer-rea- 
dable bibliograpliic description, to the apjjropriate other 
cooperating services, either directly or through a central 
redistribution unit. Responsibility for acquiring and selc- 
cting articles for coverage from a substantial number of 
journals of medium and fringe productivity will be alloca- 
ted among the member services according to the regions or 
languages or the journals. 

The necessary studies under way or to be carried out 
. in the near future involve (a) subject definitions and clas- 
sification, (b) overlap in the abstracting of primary jour- 
nals, (c) international serials data system or identification, 
listing, and controlling periodicals of interest to access ser- 
vices, (d) selection policies and overlap in coverage of 
individual papers within journals to assist in the develop- 
ment of exchange efforts, (e) editorial policies and proce- 
dures, (f) indexing approaches-similarities and d.ifferen- 
ces, and (g) detailed inventory of member services opera- 
ting systems to identify degrees of compatibility and con- 
vertibility. 

It is anticipated that these studies can be carried out 
during 1971-3 and elements of the input plan would be 
implemented by participating services in the interim as 
feasible. Further details of the Input plan are available from 
the ICSU AB headquarters in Paris. 

Finally, in regard to Part II, Processing, and Part III, 
Output, stages, drafts of these programs are underway. 
TI.Te time has not come yet for ICSIJ AB to launch such 
plans which need to be carefully studfsd and have the full 
support and back-up of its member services before being 
publish-ed; When these- are ready to launch, they ■will be' 
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niade as well known all over the worU- as had the «input 
plan», and comments from other organizations, compo- 
nents of the world system for information transfer wiN 
be as welcome as they were for this Input plan. In deve- 
^^c?? l^^^ ^'^'^"^ systems for access services, the 

IC^U AB would ,be pleased to cooperate with any other 
organizations or individuals competent to contribute to 



the work. 



6. Conclusion 



Access services may have, and will have, a brilliant fu- 
ture. The future development and improvement of access 
services will save time, money, avoid duplication of ef- 
orts and help human progress faster along the road of 
knowledge for the social and economic benefit of. all man- 
kind. 

Cooperation at all levels is a big venture. Tv,'o basic 
principles must not de forgotten when one tries t,- deve- 
lop mternational cooperative activities. No plan is croing 
to succeed, however attractive it may be, if each component 
of the network is not going to benefit, in one way or the 
other. And, secondly, a plan which would usually take 
two or more years Ic be implemented within a single or- 
ganization or nation, will take at least twice as long to 
be implemented if it involves international cooperative 
activities. 

Nevertheless, the opportunity for a coordinated interna- 
tional network of access services for the transfer of scien- 
tific and technical information increases each year. Major 
and current motivating forces are: 

a. Access services are eager to improve their systems 
and services. 

b. The financial pressures are forcing access services 
to consider ways of sharing rather than duplicating 
materials and resources. 

c. The advances in technology of communications per- 
mit sound, pictures and digital data stored at dis- 
tant locations to be made available rapidly and ef- 
ficiently with relative ease. 

Access services strongly believe in cooperative efforts. 
We hope to have convinced the readers of this paper. 
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INTEGRATED INFORMATION SYSTEMS 



1. Definition: general aspects 

A major prospect for meeting the present critical si- 
tuation in the domain of scientific communications — 
which is a natural corollary of the advancement of scien- 
tific and technical revolution is the development and 
practical implementation of a special type of information 
systems, known as Integrated Information Systesns. What 
is an integrated information system? 

By an integrated information system we mean a mul- 
ti-function automated information system which, once an 
information item has been fed into it, will repeatedly use 
that item in accomplishing all kinds of information tasks; 
it is intended for collecting, analytical-synthetic proces- 
sing, storage, retrieval, and dissemination of scientific 
and technical information to fully satisfy comprehensive 
meeting of the information needs of scientists and practi- 
cal specialists. In other words, exhaustive data on the 
contents and form of every scientific document are ente- 
red into the automated information system only once to 
be stored in it; this complete (exhaustive) description of 
the document is used by such a ' system as a basis - for 
producing all printed aids and conducting all types of 
information service that do not imply any additional crea- 
tive processing of that input information on al! kinds of 
request. 

An integrated information system is a complex whole 
made up of several simple systems, each designed . for 
some basic function and capable of independent opera- 
tion, but taken in an organic unity with the. other simple 
systems-being an inherent part of the integral complex 
which can fulfil certain new functions infeasible for any 
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of its conslilucnt simple systems. Tliat is to sav, the to- 
tal set of tile functions porfoj-med ■ by a correctly' designed 
integrated informcitioii system must always be greater 
than the arithmetic total of the functions that its consti- 
tuent simple systems can perform. An example i of sucli 
new function of an integrated information system is its 
capacity for meeting practically all types of ' information 
needs, whereas simple systems are u.'^^ially oriented at a 
particular type of need. 

It should be emphasized that an integrated informa- 
tion system is one system, that its subsystems and com- 
ponents must be fully integrated, that is organically tied 
m and merged together. These subsystems^" and compo- 
nents make up a single whole, indivisible without a loss 
of some spccilic property. 

A complete documentary integrated information sy- 
stem must comprise at least the following functional sub- 
systems: 

— selective dissemination of information, 

— current-awareness bulletins, 

— abstract journals with indexes, and 

— retrospective .search of documents and information. 
_ ^ We believe that the proper approach to building an 
integrated mformation system is not «i:nductive» ~ i e 
by identifymg the common elements of the simple systems 
bemg integrated which make it possible to join or tic up 
these systems into a single complex— but we think that 
tins approach should be «deductive» ~- i e by decompo- 
sing the information handled by each of the simple sys- 
tems being integrated into structural elements, identify- 
ing the non-repeating structural elements, and devising 
a new bigger system handling only the selected minimum 
of structural elements but capable of all the functions 
performed by the initial simple systems. Thus we obtain 
not a «mi.xture» of the simple systems, but so to speak 
their <<solutiion», which is characterized by less entropy 
than that of the arithmetic total of those systems. 

An integrated information system in principle resemb- 
«es Aaeccano — a mechanical construction game for child- 
ren, which is a set of standard • parts of a few patterns 
and a manual showing the child how to put together va- 
rious geans using- the s-ame parts. ...... • 
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The concept of an organic union, i. c. inlcjjiralion, of 
information systems descM-vcs a special discussion. Inte- 
gration is: a totality of events or actions leadin<^- to ilie 
end of existence of separate parts as individuni free enti- 
ties and to the emergence of a previously non-esistant 
'.lew entity possessing certain new unprecedented peculiar 
properties. The notion of integration is intrinsically con- 
nected with the notion of integrity, for without it integra- 
tion would have been synonymous to sijiiplc summation 
producing no qualitative change, that is the very notion 
of integration would have been senseless. 

It is not always possible to combine separate parts 
into an integral whole. For that, these separate parts 
must possess a certain sum of properties affording tlic 
emergence of a system of connections between these 
parts. In relation to biological systems Acad. V. A. En- 
gelhardt refers to such properties as .desmogenic (derived 
from Greek desmos — connection') 120, p. 8071. 

A system of connections between the parts is a sine 
qua. non of an integral whole. It is these connections 
which constitute the new entity distinguishing the inte- 
gral whole from the total of its component parts. «Kno- 
wing everything of what 'one' is and that one and one 
make two," we still do not know everything about what- 
'two' is, for some AND is added here and it is to be known 
what that AND ' purports», as L. von Bertalanffy writes 
in a half-joking way [20, p. 803], 

As a system of connections emerges which joins to- 
gether the former free parts into an integral whole, these 
parts to an extent lose some of their properties; the lattet 
are converted into the properties of the complex whole 
that are not identical with the lost properties of the con- 
stituent parts. 

Finally, the parts constituting the integral whole— as 
contrasted to a mixture — must be arranged in it in a fi- 
xed and regular order. 

To recapitulate, the characteristic features of an inte- 
gral whole as regards its interreations to the constituent 
parts are: 

— emergence of a system of connections between the 
parts constituting. the integral whole; 

— the disapperance of some of the part's properties as . 
the result of its incorporation in the whole; . ■ • ■ 
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the emergence or new properties in the integral who- 
le, differing from the properties of its constituent parts- 
and * 

— the regular arrangement of the parts in the integral 
who e, the determination of their spatial and functional 
relationships. 

These features have been ascertained by Acad V A 
Engelhardt, the father of a science-cognitive trend in 
the present-day biology which he baptized inteijratism 
f20 pp. 807-808; 21, p. 10-1 IJ.. The essential Ideas o^ 
that research trend have a particular value for those con- 
cerned with the theory of integrated information sys- 
tems. . 
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Abbreviations iiscd 
D — CRT dispJay witli n kovboard 
MP-imicroriches of oritrijial'sclent ific documents 
MT-m;ijrnetic tapes witi indexed search files 
spi— selective dissemination of information 
SP-sccondary publications (current-awareness bulletins. 
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abstract journais, etc.) 



An integrated information system 5iiioiild be distingui- 
shed from wliat is called an «information bank» (or «da- 
ta bank»). An «information bank» here implie-. a compu- 
ter — based information system whose main function is 
to cater to the information needs of scientists and practi- 
cal specialists using ready-made research files supplied 
to that system by one or more integrated information sy- 
stems. As a general rule, the «information bank» itself 
performs no analytical or synthetic processing of the 
documentary sources of scientific and technical informa- 
tion. The overall scheme of interaction of a major inte- 
grated information system with a network of «informa- 
tion banks» connected to it is shown on Fig. I. 

2. Multidisciplinary information needs 
of the contemporary scientists and practical specialists 

Integrated information systems have been called into 
existence by a social need stemming' from certain peculiar 
features of the organization and development of modern 
scie,nce and technology. 

An important feature of the present-day science and 
technology is the fact that their organization is based on 
at least three dimensions: 

— science and technology retain the traditional divi- 
sions into disciplines: 

— science and technology are organizationally arranged 
by the economy sectors and industries, which are not 
identical to the scientific disciplines; and 

- — science and technology are becoming more and mo- 
re oriented on multidisciplinary problems, necessitating 
the use of the methods and results of a great number of 
scientific and technical disciplines. 

f This means that to meet the information needs of the 
contemporary scientists and practitians one must be able 
to produce any disciplinarity pattern of the information 
aids and services, i. s. to group or «package» the sources 
of scientific information by any given combination of 

■ characteristics. This task is impracticable with the con- 
ventional procedures of preparing secondary publications. 

Ariother major feature of the present-day science and 
technology is the fast growth of its mass scale. The past 
decade alone witnessed, a 2.7-fold increase of the num- 
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ber of scicnlLsl- in llic USSR, which ^imonnled to 985, 200 
late ni 19 / 1 . 

Besides, by the same dale, the number of dipiomed 
o^^n'nnn'' employed in tlie nation^^l economy was 
^,bDU,000 while special isls with . higher and secondary 
special education tolalied close on 10,600 000 [131 hi 
.this context any plans for providing that huge mass of 
scientists and practicians by systems of selective disse- 
mination of information should be regarded as a vain 
dream. . 

^ For a lono- lime ahead, sccondarv publications of va- 
rious types and purposes arc to remain the principal me- 
ans ol individual mformation supplies to scientists and 
practical specialists. This conclusion is corroboriaed bv 
Uie results of a questionnaire survey of 4,000 Soviet sci- 
entists and practical specialists, conducted by «Literatur- 

"f^A/Tx^M '^'^^^^ results, computcr^processed 
at VINITI, suggest tliat appro.ximateiv 45% of t'^- '■es- 
pondents replied to the question «Through what kind of 
cnannels do you receive information valuable for you?» 
()y the statement. <'<Through abstract journals, express in- 
ormation bulletins and other published aids of VINITr» 
IJyj. But ever more rigid requirements are being laid to 
the secondary publications as regards their subiect com- 
position, coverage and preparation tirne. 

To be sure, every scientist or specialist wants infor- 
mation meeting his particular needs moulded by his 
persona] characlerislics (qualifications, menial attitudes, 
"^^f £,'^9 ^^''^ specifics of the problem he is concerned 
with This suggests that every group of scientists or spe- 
cialists preoccupied with a certain scientific or technical 
problem^ should uo provided with special secondary 
publications. The conventional preparation procedures of 
a new secondary publication presuppose a new proces- 
sing (indexing and/or abstracting) of the entire colossal 
1 ow ol published and unpublished doci'ment-- describino- 
the recent developments in science and technology Mo*^ 
reover, each time many laborous operations hay'- to be 
repeated all over again in re-typing, composing, proof- 
leading, etc. In other words, each new «packaging» of 
scientific and technical information tailored to new re- 
quests calls for a tremendous expenditure of - skilled ■^a- 
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hour, lime, and niatcM'ial resources, I ntc^^ra ted i n forma- 
lion svstcins seem to be the sole reined v lor lliat wastaue. 

To Sinn up, the pressures for a fnrtlier centralization 
of indexing and abstracting of scientific and technical 
literature are becoming ever more urgent today. The nia- 
in factors of that are: 

1. A rapid growth of the nimibcr of documents pro- 
duced in any field of science or technology, 

2. Tlie increase in scattering of publications in a dis- 
cipline, problem or subject field over thousands of perio- 
dicals pLd:)nshed in different languages (a result of the 
growing differentiation and integration of science). 

3. The growing number of jnterdisciph'na ry and mult.i- 
disciplinary problems and subject areas for which secon- 
dary publications liave to be provided. 

4. Tlie ever more acute need for a greater exhausti- 
vity of. coverage and reduction of the time-lags in pre- 
paration of the secondary publications, 

5. Tlic need for computer applications in quick comp- 
rehensive scientific and technical document processing and 
the execution of diverse information tasks. Tliese appli- 
cations" are economically efficient only |)rovided the coni- 
puter is used to handle very large document flows. 

3. Material premises to the building of integrated 

information systems 

Mankind's need for integrated information systems is 
sharpening, but to create or implement anything—the in- 
tegrated information systems in the case at hand— the 
community must possess adequate material capabilities. 
As a matter of fact, the community usually never feds a 
real practical need for what it is teclinically — at least 
potentially — incapable of creating or implementing. As 
Marx wrote, «mankind always sets itself only such tasks 
as it can solve; since, looking at the matter more closely, 
it will always be foimd tliat the task itself arises only 
v/hen the material conditions for its solution already exist 
or are at least in the process of formation» 111, 

The necessary conditions for the creation of an inte- 
grated information system are 

— tliat the information to be handled should afford 
this, and 
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— ihat the adequate iechnical facilities should exisi. 
The analysis of all kinds of information service shows 
liiat they are all based on — taken in different combina- 
tions — largely the same information items — author na- 
mes, document titles, imprints, classification numbers, 
keywords or descirptors, and the like, (Cf.. Table 1). Hen- 
ce, by entering into a big information system all the 

T n b 1 c 1 
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non-repeating information items the system needs to 
perform its particular . functions and keeping these items 
within the system for a pre— specified time span, one 
enables such a system to an efficacious performance of 
the functions of a set of simple information systems, per 
units costs of each function in that case are 'lower than 
with a simple (specialized) system. This is ensured by 
sparing the multiply repeated costly input of the same 
information items into different specialized systems and 
storate of those items in these systems. 

The technical facilities necessary for deyeloping an 
integrated information system must enable; 

— the storage of great amounts of information in a 
form permitting further machine handling; 
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— aulonialod selection, from the file stored, of all the 
information items possessing a given characteristic or 
combination of characteristics; 

— the necessary arrangement of the retriev: .i (selec- 
tee!) hi fori Mali on; and 

— rapid output of the relevant information in a format 
convenient to the user. 

All that became possible after the high-speed lar- 
ge-memory digital computer came on the scene. 

4, Computerization: an urgent need in preparing 
secondary publications 

Abstract Journals (AJ) with a system of indexes have 
been the principal medium of information supplies to 
scientists and f actical specialists for the past 140 years. 
The first sciemtfic AJ in tlic world seems to have been 
the «Bulletin universel des sciences et de rinduslrie», 
.started in Paris around the 1820s. The new service was 
warmly received by the learned readers, who soon began 
to refer to the new periodical simply as «Ferrusac"s «Bur- 
letin» in honour of its editor, who had formidable name 
Etienne Just Pascal Joseph Frangois d'Audebard, Baron 
de Ferrussac. The baron had been an army officer, but 
he had resigned to devote himself entirely to science, and 
had conceived the ambitious plan of creating a magazine 
which in brief reviews would give the content of all new 
books and articles concerning the natural sciences, phy- 
sics and mathematics. He had succeeded in engaging so- 
me eminent scientists as his collaborators, which ensured 
the high scientific standards of his AJ t9, p. 1401. 

The first specialized disciplinary AJ appeared in the 
field of chemistry. It was the «Pharmaceutisches Cent- 
ral — Blatt», initiated in Germany in 1830 and from 185G 
till its closure in 1969 published under the title of «Ghe- 
misches Zentralblatt». A thousand and a half AJ in^ the 
various fields, problems , and subject areas are published 
in the world at present. 

The purely informational functions, however, are but 
a part of tasks fulfilled by AJ in the present-day science. 
AJ is part and parcel of scientific and technical literature 
and hence of the social mechanism of science. The main' 
functions of AJ today are: 



<i) it is a medium of sigiic'ilizing to scientists and spe- 
cialists all scientific aiul technical jjubl ications in a given 
field or fields (current-awareness function); 

l>) it is a tool of retrospective search for scientific do- 
cuments in the various disciplines, problem and subject 
areas; 

c) it serves as a compensating means to surmount the 
publications scattering — tliis negative reverse side of sci- 
ence differentiation; 

d) it promotes a considerable lowering of the langua- 
ge barriers; 

e) it promotes the integration of science by enabling a 
scientist or specialist to keep abreast of the accomplish- 
ments in the allied fields of science or engineering and 
to make use of these advances in his own field; and in due 
time to identify tne «juncture» fields with their active 
«growtIi points» and to switch over to these fields, etc.; 

f) it helps to maintain the unity of science (through 
unification of special terminologies; suppression of the 
growth of the language barriers; eiaboration of pragmatic 
classifications or "subject schedules of sciences, etc.); 

g) it serves as a means of indirect evaluation of the 
scientific standards of publications (the publications of a 
lesser scientific informativity are not covered in AJ of me- 
rely signalized by bibliographic references). 

Only a half of the above functions of AJ can be viewed 
as purely informational, while the remaining functions are 
crucial to the advancement of science as such. Mence, the 
support and further improvement of AJ is a universal 
scientific comnn'tment. 

Supplemented by a system of current-awareness bulle- 
tins, AJ, as mentioned, are the major medium of informa- 
tion supply to scientists and specialists. However, the 
users today are no longer satisfied either with the current 
time terms of the preparation of current-awareness bul- 
letins and AJ, or with their subject scope patterns and 
schedules. What is to be done to cut down the time spent 
on preparation of the secondary aids and to issue these 
aids in any discipline, problem field or subject area? 

An inherent flaw of the conventional procedures still 
widely practised in AJ preparation is the fact that, up till 
the transfer of the material to the printers, the process unit 
being handled is a single publication (document). In the 
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printing phase, Uio pr(X-css unit is nn array of scconc'.ary 
(lociiiiienLs — ai^stracts, • annotations and bililiogrii j3liio. 
descriptions, which togeth.er make up the manuscript of 
the current AJissue. Prior to handino- over llie AJ manus- 
cript to tile printshop to the composed and printed, liic 
abstracts, annotations and bibliooTapliic descriptions are 
arranged (ordered) by the subjeet schedule adojited in tlie 
AJ scries in question, and serially numbered. The serial 
number serve to identify the documents covered in the 
A J and arc necessary for the indexes. 

Al publications i>repared to be included in the current 
issue of AJ lie by after they have been edited until the day 
of t:he editing-and-publishing cycle on which the entire ty- 
pescript of the issue has been fitted together to be lianded 
over to the printshop. If, however, one could pass on to 
the compositor and hence to the proof-reader each absti-act 
as soon as it has been edited, the Me-l.iy time would be cut 
down to a Jia I f-day-instead of 7 or 1 4 "calendar days, res- 
pectively, for biweekly and monthly AJ. Why then has this 
practice not been introduced? 

The chief stumbling block is this. The large flows of 
documents being handled (in VINITI about 4,000 per wor- 
king day) make it very difficult to make up the AJ issue 
and number the items in it when using conventional type- 
setting facih'ties: in fact it would take so much time and 
effort to ensure that the separate abstracts be not lost du- 
ring the typesetting and proofreading and later during the 
arrangement of these abstracts acco-ding to the subject 
headings, to lay-out the issue and assign serial numbers 
'to the abstracts, and to transfer these numbers to the card 
files of the indexes, that the procedure would be practically 
not worthwhile. 

The only possible way to in'troduce a system of typeset- 
ting and proofreading of each abstract rightaway after its 
editing is to use a computer featuring a memory of an ex- 
tremely high capacity. A ba'tch of abstracts edited in the 
previous day will be recorded every day on a machine-rea- 
be tantamount to the typesetting of ithat batch of abstracts, 
dable medium-punched tape or magnetic tape, which will 
Every abstract must be assigned with one or more num- 
bers of the subjedt headings under which that abstract co- 
mes. The proofs are read either immediately after the re- 
cording the abstracts on the machine-readable medium, 
o . * 
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Fig. 2 Alterna^live schemes of document-by-document 

input of texts ai«^ proof -fcadii^g 

Abbreviations used 

MTt — magnetic tape with Tx and input errors 
MT2 — inagneMc tape with corrections of errors 
AlTg— nifiguetic tape with corrected Tx 

Pi — punched tape with Tx and input errors 

Pj^ — punched tape with corri?!ctions of errors 
S—start 

Ti — typescript wftli Tx and input errors 
. Ta— typescript with corrections of errors 

Tx — text of edited biblingrapliic entry, abstract, etc. 
Txi— print-out or photo galley of Tx and input errors 



computer 



or on compulor-profluccd galleys of A J. The possible sclie- 
niatic procedures of tlociniienl-by-clocunienl eivtry of texts 
into the computer and of their proofreading are shown on 
Fig. 2. 

Computer , input of abstracts on a machine-readable me- 
dium should be done not on a daily but on a weekly or bi- 
weekly basis, so that parallel to the input operation one 
could perform selective dissemination of information. In 
this way reacly abstracts will be stored in the computer to 
be included in the next issue of AJ. 

When the day comes *to make up a current issue of AJ, 
the computer automatically arrays the abstracts in its me- 
mory into the necessary sequence. For that purpose Ihe 
subject heading nuriibers are used, which have been assig- 
ned to every abstract during the editing. Then the compu- 
ter performs the seria-l numbering of iTie abstracts and 
prints out the AJ issue in the form of galleys. A high-speed 
phototypesetter (PTS) connected to ihe computer can be 
efficiently used for that purpose, since only that kind of 
equipment features the vast set of diverse symbols (up to 
1,500) necessary for the composition of AJ in all fields of 
science and technology, PTS produces photo galleys, whicii 
are read by AJ editors. Through the computer, the neces- 
sary corrections can be introduced, whereafter PTS produ- 
ces the final photo galleys to be used for the make-up and 
photo offset reproduction. 

Moreover, application of computers and phototypeset- 
ters for preparation of AJ dummy is justified more than 
just by the essential redudtion of the time of AJ prepara- 
tion. Even more important is the fact that after the gal- 
leys have been printed out, all the information entered into 
the computer remains there, it is inefficient, however, to 
store complete texts of abstracts in binary form; for they 
take a tremendous storage capacity — an annual total of 
10'« bits for VINITI. Therefore, abstracts and annotations 
. are printed out from the computer and automatically trans- 
ferred on to roll film and/or microfiches. Wha't remains in 
the computer memory is complete bibliographic descrip- 
tions of the publications with their subject heading num- 
bers. This information alone permits automatic production, 
l^v.the computer-phototypesetter system, of the following 
ERXCO™^tion services: 
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— iliwcckly cn rreiit-awarcnoss hiillelins. based ov: -A 
classified sysleiiuitic nrrangcmenl of niatci-ial; 

— aullior indexes; and 

■ — permuted indexes to A J issues. 

With tlie conventional procedures one iias .to repeat 'Kr>" 
preparation of tlie typescript for each of the above servi- 
ces, as well as the composition and proofreac'ing opera- 
tion^>, notwithstanding tliat all this has l)ccn or will b;':- 
done for AJ. 

A'loroovcr, Ijy addiiig to the text of tlie alxstracl, in the 
coiu'sc of its writing and/or editing, certain information 
elements over and above the subject lieading numbers 
(e. g. keywords indexing the publication) one will be able 
automatically to prepare, on the basis of information on- 
ce fed into the com p Liter, not just A J but also other se- 
condary publications, and also to conduct a variety oi 
information services, including: 

— selective dissemination of information, 
■ — preparation of ciu'rent-awareness bulletins, 

— preparation of subject and author indexes (to each 
current — awareness bulletin issue, AJ issue, as well as 
semiannual, annual and five — yearly). 

— conduct retrospective searc'.i of documents, etc. 
Besides, the approach describet^ above brings an easy 

solution to the important and diff"K.uIt problem of prepa- 
ring secondary publications not only for the individual 
disciplines and fields of science and technology, but also 
for the interdisciplinary and multidisciplinary fields.- 
Owing to the accumulation of the texts of abstracts, an- 
notations and bibliographic descriptionb in the computei. 
memory, it suffices to produce a current — awareness bul-^ 
letin issue or AJ on any interdisciplinary problem to ma- 
ke the computer, controlled by the corresponding pro- 
gram, to glean out of the cumutative file the pertinent 
texts, arraj>ge these in the appropriate order and repro-, 
ducc th^se as photo galleys with the aid of PTS. Hence 
the costs of preparation of a secondary publication or 
some interdisciplinary problem will be equal to the cosi 
of the computer time spent on making up the text of the 
issue and on its reproduction in the form of photo gal-' 
leys, plus the costs of the printing of the run. No additio/ 
nal investment of intelectual efforts is required. 
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5. Design of integrated information systems: 
general principles 

We thus come to the concept of an integrated infor- 
mation system (IIS), the general block — diagram of 
wliicli is given on Fig. 3. IIS should be designed on the 
basis of the principle of one-time exhaustive analyti- 
cal-synthetic processing of cacli document — journal 
paper progress report, patent or inventor's certificate 
specification, c — to be done by highly skilled specia- 
lists, and Followed by one-time input of the results ()f 
that' processing iiito the computer, for the subsequent re- 
peated and multiple utilization of these results— for se- 
lective dissemination of information, preparation of re- 
quired secondary publications, as well as retrospective 
search of documents and information. In other words, IIS 
«packageL.> information materials according to the types 
of information needs. Tiie word «one-time» does not 
imply that at the IIS input each document is completely 
proce'^sed by one specialist. It only means that tlic pro- 
cessing fully excludes any dU;?lication of intellectual ef- 
forts: It is permissible tr> enter into the computer only 
such information items thai have not been previously en- 
tered into it, and cannot b<^. directly obtained or derived 
from those already stored. 

To ensure an exhaustive analytical-synthetic pro- 
cessing of every document to be entered into IIS, special 
forms are used, wdiich are filled in by the specialists ab- 
stracting and indexing the document. The forms are then 
edited and bcc^-me the inn ut docu.ments of IIS. 

A form u : in IIS must Include at least the follo- 
wing informa. >u items: 

■ — document working number; 

— document type (journal paper, monograph, patent 

specificatiu.i, etc.); ' , , . 

— Russian translation of the title (for foreign-langua- 
ge documents) ; , . - j, t + 

— complete bibliographic description the document 
(author's name or names, original title, imprint); 

— classification numbers (by the subject headmg sche- 

tlule) ; t . , . It 1 

— list of descriptors and/or keywords mdexmg the do- 

; cumciit; 
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— abstract. ■ : 
Tin's set of data seems siiflich'/it for ;\'-eparin^- almost 
any type of secondary publication anti ior meeting most 
typos of information needs of scienlists and practical spe- 
cialists. 

IIS embodies an cssciiti- -''y new approach to the de- 
sign of automated information systems, striving towards 
the (greatest possible saving;- of intellectual efforts spent 
on scientific document processing via complete elimina- 
tion of all duplicate operations. Thai <Toal is achieved by 
enabling a multiple re-use — in various combinations 
of the same information items, once fed into the computer 
and permanently stored there, for the automated prepara- 
tion of all kinds of secondary publications and for the va- 
rious types of information services. 

The development of IIS prc-supposcs a far better or- 
ganization of work in sci-tech information atrencies — 
highly qualified and skilled scientists and e.xperts should be 
concentrated at the system input to do nil (he intclleclu- 
al work of abstracting and indexing scientific documents 

At present US of Ihis lypc arc under development in 
the lollowmg mformalion centres and institutions of the 
world: in' the USSR— VINITI, for all the basic fields of 
exact, natural and technical sciences, as well as in a num- 
ber of central branch institiites of scientific and techni- 
cal information; in other countries Chemical Abstracts 
Service (USA), for chemistry, including chemical engi- 
neering, chemical physics and biological chemistry; fii- 
stitute for Scientific Information (USA), for chemistry 
and chemical engineering; American Institute of Physics 
for physics; Institute of Electrical Engineers (UK) for 
pnysics, electrotechnoiogy and control; Excerpta Medica 
Foundation (the Netherlands), for medicine- Centre de 
documentation du Centre national de la recherche scien- 
tifique (France), for all the major fields of science and 
technology; the Internationale Dokumenlationsgesell- 
schaft ftir Chemie (a corporation founded in 1967 by a 
group of chemical and pharamaceutic firms in FRG Aust- 
ria, and the Netlicrlands, and at present incorporating 
12 companies), for organic chemistry; Japan Information 
Centre for Science and Technology, for all major fields 
of exact, natural and technical sciences; International 
Atomic Energy Agency (Wien), for nuclear science inc- 
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ludiiif^ Jiiarg'innl aruas; Luralom Coiiii-i' for Infc iitation 
and Docuinciihition, for reactor technology, neutron pliy- 
sics and marginal and related sectors (such as nuclear 
biologry and medicine, the chemistry and metallurgy of 
nuclear materials, api^lications of radioisotopes, etc.). A 
similar system designed forithc jiroduction of «lmiex Mc- 
dicus» (a bibliographic index in mediciiTc, botli clinical 
and non — clinical, and l)iological medicine) is operatio- 
Jial since 19B4 at the US Nalicnal Library of Medicine. 

6.nS— VINITI: main features 

Practical work on development of VINITl's IIS was 
started in 1969. This IIS has no analogue iji the world; 
its specific features are: 

a) IIS — VINITI is devised as the central system of 
the country for cen'.ralized analytical-synthetic proces- 
sing of world's scientific and technical literature with 
exhausti\-e coverage; 

b) IIS — VINITI is a multidisciplinary system encom- 
passing all exact, natural and teclniical sciences and fi- 
elds; 

c) the development of IIS — VINITI is to be conducted 
in parallel witli the further ujifolding and improvement 
of all the existing activities of VINITI in the present or- 
ganizational forms. 

These specific features give rise to peculiar difficulti- 
es and problems as yet untackled in the world experience 
of design and practical implementation of such informa- 
tion systems. 

The designers of IIS — ^"VINHT must provide for: 

— compatibility of this system with the other similar 
systems both in the USSR ? 'd abroad; 

■ — capacity for processing 2.5 to 3.0 million publicati- 
ons a year, i. e. the lerJold of the biggest known IIS in' 
the world; 

— surmounting of the difficulties springing from pub- 
lications scattering as well as differentiation and integra- 
tion of science; and 

— preservation of the existing organizational forms of 
work paralleled by a gradual re-training of personnel in 
the period of IIS implementation.. 
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7. IIS— VJNITI: basi. 'unctions 



liS— VINMTl niusi perfonn the following basic functions; 

a) Indexing, annoiating and abstracting all world 
scientific and technical literature, and on that basis,— 

— production of current-awareness bulletins in the va- 
rious fields of science and technology, subject and prob- 
lem areas; 

-— production of AJ «Referativnyi Zhurna]» with inde- 
xes in the various fields of science and technology, sub- 
ject and problem areas; 

— dissemination, ;;n a subscription basis, of magnetic 
tapes with indexed retrieval files in the various fields of 
science and technology, subject and problem areas; 

— selective dissemination of information on sci-tech 
literature of the world serving a certain group of pat- 
rons; . 

— retrospective search of documents and informaticn 
*on individual requests in the whole accumulated file. 

b) Preparation of surveys of the type «Advances in 
Science and Technology»; 

c) Production — on users' requests — of copies of any 
scientific documents reflected in current-awareness bul- 
letins and AJ. 

d) Translation — on users' requests — of foreign scien- 
tific and technical publications into Russian. 

VINITI is to cease the publication of its express — 
information bulletins («Ekspress Informatsiya») , trans- 
ferring that function to the central branch institutes of 
scientific and technic;;! v-'.formation. 

The current-awareness bulletins wiTi cover all docu- 
ments relevant to the field, subject or problem area con- 
cerned, v/ithout any exception. The bibliographic descrip- 
tions in current-awareness bulletins will be arranged in 
a systematic order. Each current-awareness bulletin is- : 
sue will be furnished witli an autlior index and a list of 
source j>eriodicals. 

AJ should iiiclude only abstracts and annotations but 
not single bibliographic description. Any duplication of 
abstracts and annotations beiwee- 'Me different series and : 
issues of AJ is admissible; Each AJ issue must be furni- ; 
shed with an author index and an alphabetic subject in- ; 
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dew Cunuilalivc ifulox'es should be seini-anmial, <iiiniial 
and triennial. 

IIS--VINITI is to perform its basic functioJis under 
Ine following time terms: 

a) Publication of biweekly current-awareness bulle- 
tins— with a timc-iag of 20 fo 25 days after the acccf- 
sion of the source publications to VINITI (including the 
two weeks during which materials are accumulated for 
th ' current bulletin issue). 

■)) Coverage of at least 50% of all .'ncoming publica- 
tions in A J — after 50 days. 

c) Production of copies of all sci-tech documents on 
users' requests — in 10 days. 

8. The functional structure of HS— VINITI 

In order to cope efficaciously with the above functions 
observing the required time terms, IIS— VINITI must 
comprise 5 specialized subsystems (Fig. 4): 

1. Literature processing subsystem, responsible for 
the indexing and abstracting of world's scientific and te- 
chnical literature, and performing, on that basis,— 

— preparation of current-awareness bulletins, 

— preparatio:? of AJ with indexes, 

— selective clicsemiiiation of information, 

— prepartaion of surveys of the type «Advances in Sci- 
ence and Technology»; 

— on-subscription dissemination of indexed search fi- 
les (on magnetic tape) in any field science and tech- 
nology, subject or problem area. 

Most of these functions are being at present carried 
out by VINITI Institute. 

2. Printing subsystem, responsible for the publishing 
and distribution of information materials being prepared 
.in the Literature processing subsystem. This'" functions 
will be performed — as it is at present — by VINITLs 
Printing Plant. 

3. Information service subsystem (or «infurmation 
bank»), which will conduct retrospective search of docu- 
ments and information on individual requests. At present 
there is no such system in VINITI, the only exception be- 
mg the information retrieval system is information on 
science having a file of 35,000 documents. 
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4. Photocopy service sul)sysle!iT, producin.^ copies o* 
publications on users' rccjuests. This function is now fui- 
filled by the so-called Reference Information Centre of 
VIV TI's Printing Plant (RIC). 
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SDi — st'ieciive disseminntion of iin"tirr?:;i'iion 



5. Translations service subsystem, which will produce 
Russian translations of foreign scientific and technical 
documents on users' requests. This function will continue 
to be performed by the Ali-Uiion Centre for Translr/" 
on of Scientific and Technical Literature and Documen'- 
tion (till May 1972, called «Translations Bureau 
.VINrni£>). 
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It should bo emphasized the Informntion service suh- 
syslem («information bank») sould specialize un service 
of the users. The retrieval files will enter that system in 
a form ready for use from the Literature Processing sub- 
system. This will enable the staff running the Informati- 
on service subsystem to concentrate on the analysis and 
quick meeting of the information requests of scientists 
and specialists, and on the constant improvement of the 
efficiency not only' of that subsystem but also of the Lite- 
rature Processing subsystem. 

Photocopy service subsystem — in contrast to RIG of 
VINITI's Printing Plant -~ should be an organizationallv 
independent and nonprofit agency with the sole function 
of quick fulfilment of users' orders for copies of scientific 
and technical publications. For that, the Photocopy ser- 
vice subsystem must have a complete file of all docu- 
ments covered m VINITI's informational publications. 
This file must be on microfiches of 105X148 size (reduc- 
tion ratio — 24:1/; these will come to the system from 
the Literature Processing subsystem. 

9. Information-retrieval languages of IIS — VINITi 

IIS— VIN.ITI will make use of three information re- 
trieval languages: 

— the system of classifications that makes up the Sub- 
ject Heading List; 

~ UDC; and 

— descriptor language. 

The Subject Headings List is used for automated ma- 
ke-upeof informational publications, and also for selec- 
tive dissemination of information and retrospective retrie- 
val. The List has such a schedule and content which make 
it possible to issue current-awareness builetins and AJ 
issues by any desired combif^ations of characteristics,- 
provided this is economically justified, L e. there is a suf- 
ficiently large group of users with similar information ' 
requests. That m.eans that the List should be annually 
updated and revised to take cognizance of the findings 
of analysis of the performance of the system of selective 
dissemination of information and of the individual infor- 
mation requests. The List can be elaborated on the basis 
of the existing «Rubricator of abstracting journals of the 
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'-■^SR^ 110, 11], oFLated in the past 15 years of VINITTs 
ycUvities. nnd, apparently, providing " [or an adequate 
refaction of the information needs of a great proportion 
of Soviet users. 

Tiic descriptor language is intended for compilation 
of alphabetical subject indexes to AJ, and for selective 
dissemination of information and retrospective retrieval 
The vocabulary of the descriptor language is constituted 
by a fixed list of descriptors and any keywords, *<clected 
directly from tlie text of documents being indexed and 
functioning as descriptor modifiers. Only links will 
used in document profiles. The query profiles will be for- 
mulated using Boolean operators, AhlD, OR, NOT. and 
weights. 

To raise the efficiency of indexing and information 
retrieval, a system of compatible thesauri in various sci- 
entific and technical fields, and subject or problem areas 
is planned. 



10. US — VINITi: processing procedures 

The designers of IIS- AaNlTI proceeded from the fol- 
lowing assumptions: 

a) The main function of the Literature processing sub- 
system is to prepare secondary publications (current-awa- 
reness bulletins, AJ with indexes, etc.) and, on that ba- 
sis, — indexed retrieval file on magnetic tapes and discs. 
All other kinds of information service (except for the se- 
lective dissemination of information) should be conducted 
by the Information service subsystem using that file as 
well as by other specialized subsystems of IIS— VINITI. 

b) IIS should perform its functions on the basis of an 
exhaustive utilizatron of the single flow of scientific and 
technical publications. Duplicate subscription 'to foreign pe- 
riodicals and other sources should be reduced to a mini- 
mum. At the same time, losses of such source? during pro- 
cessing must be ruled out. 

c) Data on every document mu&t be entered ir.to IIS 
not as a single item, but by parts. First, bibliographic des- 
critions of the documents with subject heading numbers as- 
signed to them must be entered. This information takes a 
minimal time to prepare, though it permits to start the 
servicing of the users long before cur.ent-awareness bulle- 
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tins are issued 'to say nolhing of AJ. Nest, search profilcj^ 
oT the document {lists of keywords and lJDC Indexes) 
must be entered into HS, since the deep indexing of docu- 
ments lakes far more time than the prepara'tion of their 
bibliograpliic descriptions and their indexing by the Sub- 
ject Headings List. And only at the last step the texts of 
annotations and abstracts should be fed into IIS, for these 
lake the longest time to produce. 

d) There should be no time limit for the storage of in- 
formation in IIS. 

The technological process in IIS — VINITI will be app- 
roximatingly made up of the following sequence of proce- 
dures (see Fig. 5) : 

1. The flow of sci-tech literature passes through the 
conventional accessioning and registration procedure. Du- 
ring the registration, a special blank form is attached to 
each publication-journal, book etc. — giving the abbrevia- 
ted title of the publications, title code, year f.f publication, 
volume and issue number. 

2. Immediately after the registration (i. e. prior to 
marking-out), all periodicals and serials are copied in 
complete text on to 105X148 mni microfiches, which pass 
over to the document storage unit of the Information servi- 
ce subsys'tems, while the copies of these microfiches are 
passed to the Photocopy service subsystem. 

3. After copying, all publications appear on the new 
accessions display twice a week, alphabetically arranged 
on racks and available for scanning and partial marking- 
out by the specialists from 'the different editorial depart- 
ments of VJNITI. 

4. From the display, all publications go to a special de- 
partment responsible for selection and complete marking- 
out for current-awareness bulletins and AJ of all docu- 
ments included in these publications. Each document is as- 
signed with one or more Subject Heading number? (down 
to a most narrow subdivision). Besides, the marked-out 
documents ^.re selec'ted for the subsequent coverage in AJ. 

5. The source publications, already marked-out, are 
passed to the division where thty are copied- 1 : l:on to pa- 
per on Xerox-720 or similar equipment. Specially, for cur- 
rent-awareness bulletins a hard copy of the first page of 
each document marked-out (together with a copy of the 
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blank form carrying iJic imprinl (lata' bi Ihc source puh- 
Ii(.-;:t!ons) is ])ro{liicc(l. ■ • 

6. Tlie ,liai-cl copies of Ihc' bpcning 'pages of all docu- 
ments niarl<eci-c -it go to the division responsible for recor- 
ding on punched tape or magnetic ' tape all information 
ik^ms necessary for jauloniat'ed p/ oduclion of current-awa- 
reness huMetins. Each document is assigned a working 
number, If ail foreign titJes are to be supplied with Russian 
'translatiojis in current-awareness bulletins, then, prior to 
perforating puiicl^<ed tape or typing onto magnetic tape, the 
copies of tlie opening pages are to be for a siiort time sub- 
mitted 'to the editorial liepartmenls. 

7. Information recorded on punched or magnetic tape 
is entered into the computer and after the control and 
correction procedures is stored there lor the subsequent 
preparation of current-awareness bulletivis, AJ and for 
other purposes. Besides, two blank forms are automatical- 
ly filled in for each document selected for abstracting — 
indexation form and abstract form. Each form carries the 
working number and the complete bibliographic entry of 
the document plus its subject heading numbers. 

8. Indexation and abstract forms together wi-th the 
complete copies of tlie documents go to the appropriate 
editorial departments, where each document Is UDC- and 
keyword-indexed. The indexation form fllled-in is passed 
on to be recorded in machine-readable form for computer 
input, and the abstract form is sent out for abstracting 'to- 
gether with the document copy. 

9. The form reiurned by the abstractor carries an ab- 
stract of the docunient; it is subjected to scientific and 
literary editing, whereafter ft is to be recorded in machine- 
readable form for computer input,. 

10. After a certain number of documents have accumu- 
lated in the computer, it automa'tically arrays them by the 
subject heading numbers, provides with serial numbers, 
comp:.*;es an author index and other indexes and records 
the resulting text on magnetic tape in a form necessary 
for photocomposition. 

11. .A DIGISET phototypesetler is used for the com- 
position operation. After 'the galL?ys have been read and 
corrected, the dummy is made up. The final copy is signed 
for press by the editor of the current-awareness bulletin or 
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AJ issue, and sent to the printing plant for photo offset or 
elect rograpiiic reproduction. 

12. The run of the informationcil publication is produ- 
ced and distributed among the subscribers. 

The major advantages of the above procedure are as 
follows: 

rQ) Multiple typesetting and proofreading of the same 
mformation items is fully excluded (such as abstract num- 
bers, author names and complete bibliographic descrip- 
tions), vt is no longer necessary to keep numerous and 
cumberson^e card files, etc. The result is a tremendous sa- 
ving of labour, time and materials, and a considerable re- 
duction of errors. 

b) A capability is given for selective dissemination of 
information as well as a retrospective search of documents 
(by elements of the bibliographic descriptions and the sub- 
ject heading numbers) as soon as after 15 or 20 calendar 
days since the accessioning o^ ihe source publications at 
yiNITI (i. e. for, example, during the input of the file of 
bibliographic descriptions accumulated over two weeks for 
current-awareness bulletins) , 

On the 25th or 30th day after the accessioning of the 
source publication at VINITI, i. e. after the computer in- 
put of the indexation forms, that capability becomes unli- 
mited. 

c) Automatic compilation of author, alphabetic subject, 
patent and other indexes to each AJ issue becomes pos- 
sible. 

d) A practical opportunity is created for 'the issuance 
of secondary publications in any field of science of tech- 
nology, and any subject or problem area in very short ti- 
me terms. 



II. HS-VINITI: hardware 

The implementation of IIS— VINITI should be based on 
a se't of technical faciMties, incliading: 

— a «third-generation» computer in the. appropriate 
configuration, 

— a high-speed phototypesetter, 

— a fleet of keyboard devices for the recording of in- 
put information on punched cards, punched tape and mag- 
netic tape, 

195 



— equii^ineiil lor llie slorairc and rctricva! of the iiiio 
rocopics of input docurnejits (^second loop» of MS), 

— a ricel of clectrographic copiers of the type Xerox- 
720, -3(300, -4000, etc. 

Rough calculations show that. I IS — ViNlTi .needs a 
computer featuring: 

— a ramified time-sharing system, accomodating- up 'to 
50 or 60 terminals, including at least 20 CRT displays,, 
and a respojise lime of 3 to, 5 sees; 

— a powerful central processing unit featuring 0.5 to 
i.O mill son opera'tions per sec,; 

— a 'memory on interchangeable magnetic discs -.vith 
the total capacity of up to 600 million bytes and access ti- 
me of some tens of milliseconds; 

~ a core storage with a capacity of up to 400,000— 
500,000 bytes, with an access time of 1 or 2 |.<sec; 

— storage units on magnetic tapes (no less than 
8 decks); 

— the necessary mmibcr of input and output devices 
handling punched cards, punclied tapes and/or magnetic 
tapes; 

— alphanumeric output chain printers having a set of 
not less than 192 characters and a speed of 600 — 
1000 llnes/min.; 

— COM devices for direct output of information onto 
microfiches and roll film; 

— communication processor; and 

— visual displays and terminal teleprinters. 

The displays and teleprinters, connected to the compu- 
ter, can be installed at various locations across the co- 
itntry, at distarjces of hundreds and even thousands miles 
from the computer.. These terminals will give the user a 
direct access to IIS — VINITI and immediate reception of 
relevant information. Displays and teleprinters are parti- 
cularly effective for retrieval purposes. They afford the 
' user-system dialogue so that the request statement can be 

modified by the intermediate retrieval results to ensure the 
best end results. 

To provide a great number of users with an opportunity 
^ for holding such «dialogue» *with the IIS simultaneously, 
it must be based on a computer system with a developed 
timesharing facility. The user should not have to stand in a 
queue waiting until the computer has answered an ear- 
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lier requesi of anotlier user. Tinic-sharino- will be absolu- 
tely indispensable for llie cconomie;)Hy efficiei]! perfor- 
mance of I!S— VINITI and tlie prompt answering)- of requ- 
ests. 

12. The current status oi IIS — VINITI 

At present, VINITI has four computers (two Minsk-22 
and two Minsk-32). which are heavily loaded witli vario- 
us practical tasks (in particular, preparation of current- 
awareness bulletins in automatics, radio-electronics and 
chemistry), information retrieval in the field of informaliou 
science, preparation of various indexes, and elaboration 
and updating >.f abs'iract typesetting programs, etc. The- 
se computers have a sufficient total capacity for produc- 
tion of secondary publications of an overall volume equal 
to 457o of the totality of present VlNITI's secondary pub- 
lications. 

To raise the performance of these coi iputers they will 
be joined together — by means of buffer magnetic core s'lo- 
Pages — to make a single complex system, which will gre- 
atly expand tiieir technical capabilities. The merger of the 
coniDuters into a single svstem will be completed in 1973. 
In 1971, a' magnetic-tape SC-1060A storage was connected 
to the MlNSK-22 computer, enabling it to read out magne- 
tic tape sent from abroad and to record information on 
such magnetic tapes. 

In 1972, a MD-17 s'orage on interchangeable magne- 
tic discs will be connected (each pack of discs featuring a 
capacity of 7.25 million bytes) as well as an alphanume- 
ric Potter NCR-3502 chain printer \yith a set of 192 cha- 
racters. This hardware permits to work out the necessary 
software, accumulate the valuable technological experien- 
ce, and to carry on the necessary personnel training. In 
this way VINITI will be able in a very short time to put 
into operation a large third-generation computer envisaged 
for the implementation of the final version of IIS— VINITI. 

VINITFs DIGISET 50T-1 phototypesetter has been in 
regular operation since mid-February 1970. Its working 
speed is 300 to 600 characters/sec (depending on the size 
of characters). Some 800 different characters have been en- 
•-'ered into the unit's memory, including 140 Cyrillic and 86 
Rom^n characters (in fine and semi-bold founts). All the 
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y OS have been developed at VINITI. Current-awareness 
bulletins HI automatics and radio-electronics have been 
composed on the DIGISET machine for about two ^-ears 
Moreover, a few subject and author indexes ^to some of the 

thaT unil ""'^ "''"'"''''^^ ^"""^ photocomposed on 

Computer input is mostly on punched tapes produced 
on Dura Mach-10, Supertyper, Optima-527 and -528 type- 
writers. 

The debugging of the computer programs of compila- 
tion of current-awareness bulletins in the integrated pro- 
cessing mode, thai is on the basis of a one-lime computer 

nput of iiiformalion, is nearing its termination, The first 
photo galleys attest to ^Ihe practical viability of these 
programs already providing for a h'-.-her photocomposi- 

on quality than many of foreign 113. Before the end of 
Ills year, the current-awareness bulletins for all fields of 
Chemistry and chemical engineering will be switched over 

iQ7^o\' 2'^*'''"^°,^'^''''' ^^'"'^s, about 230,000 entries in 
ly/^;. bince 1972, current-awareness bulletins in all fi- 
InL?} ^^'^oma'tics and radio-electronics (11 series, about 
oU,UOO entries a year) have been started. 

Computerizing current-awareness bulletins is merely 
an opening phase in creating IIS-VINITI. Technological- 
ly it will be completed within the curreiit year. The se- 
cond phase will be the changeover to the photocomposi- 
tion of AJ. The computer programs for that photocompo- 
sition will be. written and debugged also in 1972. From 
i-/c5, at least one AJ series will be produced in that man- 
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13. Building a nationwide network of scientific 
and technical information on the basis 
of integrated information systems 

The objective'of IIS— VINITI is not confined to the 
raising of the efficiency of VINITI's scientific information 
activities, important as it is. 

AVe think that the creation of IIS—VINITI will affect 
i\v: entire nationwide system of scientific and technical 
information and give an impetus towards the creation in 
the country of an entire network of fully compatible and 
mterconnected functional and branch IIS, 
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Spcciiiliziilion of any kind of activity, iiicliKiiiiLr sci- 
cMilific iiiforinalion work, is a major means for raism^^ 
its cfficicncv, as it permits to develop a more acivaiiecd 
technolo*rieal procedure, make- a wide use of up-to-date 
mechanization and automation facilities, and to tram 
higii-skiile:! personnel. 

Scientific and teclmical information agencies are la- 
ced with two major calegjories of functions: 

— analytical-synthetic processing of sci-tech niforma- 

tion, and 

— in forma lion services. 

It appears, therefore, reasonable to specialize the all- 
union /functional/ and central branch information a^ien- 
cies on annotatint^, abstracting and indexing of scientilic 
documents, charging the information servicnig of scien- 
tists and other specialists on «information banks», which 
should receive their reference files in a lorm ready for 
use /e. g., on magnetic tapes/. That trend is already dis- 
cernible in the organization pattern of the system of sci- 
entific and technical information agencies in a number ol 
countries. . , 

The basic units of the nationwide scientific and tcvh- 
nical information network in the USSR are— and will 
be — the scientific and technical information departments 
(STID) in R&D and D&D institutions and the teclmical 
information bureaux (TIB) at large industrial enterprises 
(late in 1971, STlD's and TIB's across the country total- 
led about 9,000); in addition, there are the interdisciplina- 
ry regional scientific and technical information agencies 
(sci-tech information centres in the territories, regions, 
autonomous republics and autonomous regions of the 
RSFSR, 62; in the Ukraine, 6; and in the Kazakh Repub- 
lic 4; republican institutes of scientific and technical in- 
formation, 15), since these bodies are directly connected 
with science and production, for it is by them that scien- 
tists and engineers are served. 

Tiie regular operation of these information agencies 
is to be SLiiDported by tiie all-union (functional) institutes 
(10) and central branch institutes of .scientific and te- 
chnical information (82) /5, p. 13/. By functional here 
and henceforward we mean an information agency which 
fulfils a certain information function (one or more) in the 
interests of the whole nationwide system of scientific and 
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teciinica] information. An example of a purely functional 
information urgency is tiie All-Union ScicntiHc and Te- 
clinical Information Centre (VNTITs) resposililc for the 
collecting, processing-, storage, retrieval and di^^seminalion 
of information about al! started, implemented and termi- 
nated Rc^-D and D&D projects in the coimlry. VINITI is 
both a -functional (analytical-synthetic processino- of all 
published sci-tech literature, methodolooical f^uidance of 
all research toward improvement and advancement of 
the national information service network and coordinf-itioji 
of this research in the country) as well as a central 
branch institute of scientific and technical information 
(servicing the system of the USSR Academy of Sciences). 

At present STID, TIB and regional information agen- 
cies-are spending a great part of their efforts -not on ser- 
vicing scientists and practicians, but on abstracting and 
indexing the relevant documents, publishing their own 
abstract bulletins, keeping their own reference files, etc., 
that is on performing the functions which arc charged on 
the ail-union (functional) and central branch institutes 
of scientific and technical information. This is due mostly 
to the fact that STID, TIB and regional in.format ion bo- 
dies are often dissatisfied with the subject schedules, co- 
verage and/or time terms of the information aids publi- 
shed by the all-union and central branch institutes of 
scientific and technical information. 

If STID, TIB and regional information agencies were 
to be regularly supplied witii current-awareness and ab- 
stracting publications having the necessary subject sche- 
dules and coverage, and produced in the required time, 
terms — which could be achieved only via creating com- 
patible IIS in all the all-union (functional) and central 
branch institutes of scientific and technical information— 
and if provisions were to be made for quickly meeting the 
information requests necessitating a retrospective search 
and quick execution of orders for photo copies of the 
primary documents, then STID, TIB and regional infor-" 
mation agencies could largely concentrate on more ad- 
vanced forms of analytical-synthetic processing of scien- 
tific and technical information — notably, on writing of 
analytical surveys, developmental forecasts, information 
synthesis, etc., and on a more profound and exhaustive 



200 



study and nieelinp: of tlie individiinl information needs of 
scientific and special users. 

The oplinuun structure of the network of scientific and 
teclniical information a<rencies in the USSR can lie xisua- 
lized as follows. A nationwide system of functional and 
branch information processing centres is to he created, 
larf^ely corresponding to the existing all-union and cent- 
ral^iranch institutes^)f scientific and technical, infonuati- 
on. These centres will he responsible for the analytical — 
synthetic processinjr of the relevant documentary infor- 
mation, striving to^the greatest possible reduction of un- 
justified duplication of intellectual efforts spent in other 
similar centres: VINITI will process all published scienti- 
fic and technical literature, VNTlTs—all data on started, 
implemented and terminated R&D and D&D projects, 
central branch institutes of scientific and technical iii- 
formation — the materials of the «iipward» flow, i. e, such 
unpublished documents which reflect the results of R&D 
and D&D work, data on new industrial articles, advanced 
know-how, .etc. Indexed abstracts of scientific documents 
will be channeled from the information processing cent- 
res to the «inrormation banks» on magnetic tape. The 
main function of these banks will be to conduct informa- 
tion servicing of scientists and practical specialists. 

This approach means that every central branch institu- 
te of scientific and technical information must abstract 
and index only all unpublished documents from its parti- 
cular «upward flow», updating the reference file it keeps 
with fragments of ready-made search files necessary 
for the branch of economy in question. These files should 
be supplied on magnetic tapes from VINITI (for the pub- 
lished sources), and from VNTITs and other centra! 
branch institutes of scientific and technical information 
(on unpublished sources). The main goal in that case is 
to bring down to a minimum the unjustified duplication 
of intellectual efforts spent on the costly analytical 
synthetic processing of the same scientific documents, but 
not to reduce the necessary diversity of the secondary 
publications and services. 

It is necessary to build up in the USSR a rational 
network of «information b3nks», taking into account the 
economic regions pattern and organized cn an hierarchi- 
chal structural principle. The upper level in that hierar- 
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ohy will he ionned j)y (he «iiiformallon h.'inks» prcsendv 
uiidor development at the all-union (functional) and cent"- 
ral branch inslitutes of scientific and technical informa- 
on; the medimn level will he the information banks» 
o be created at the central bureaux of scientific and 
technical mlormation and the republican institr.tes of 
scientific and technical information (the regional bodies). 
The next hierarchical level ^vill be the information 
banks» created at the leading STID and TIB, whereas 
the lowest level of this hierarchv will be the reference 
tVo'^'-Iu;"" already exislin^r in all STID and 

MB. Although these collections ~ as well as the referen- 
ee mformation collections of the regional information 
agencies and of the leading STID and' TIB — are not vet 
called «information ban.':s», they virtually exist and fulfil 
all tfte principal functicvis of such banks 

■ The hierarchy of \Uc «information banks» is to form 
an automatic system of «filters», allowing to the next 
higher evel only such informtaion requests as cannot 
be met by the lower-level «bank». This system of «fih 
ters» will be indispensable for ensuring the high effecti- 
veness of the entire nationwide information network In 
other words, the practice should be introduced of accen- 
ting the requests only at the «banks» of the lowest hie- 
rarchical level, i. e. at the appropriate STID or TIB ex- 
cept for specially stipulated cases. That practice should 
be strictly ooservcd. A request will then be sent to a 
higher-level «bank» only by the starting lower-level 
bank if the latter is itself incapable of meeting it. Howe- 
ver, the response will be sent directly to the user, inde- 
pendent of the hierarchichal level of the respondent 
«bank». ' 

Every «bank» must continually study the information 
e tith It^'f ^' '^-cordingiy to update its reference ' 
He With the accessions incoming from one or more in- 
formation processing centres. n'Uit ui 
The «information banks>> of all the al)ove categories 
no V • f f^'-^^^ion^ng in the USSR. Th? task 
now is to ensure the necessary compatibility of these 
«banks. and o set the system of their interaction g in ' 
according o the principles set forth above. ^ 
fnr.Vnr possible to organize a unified system of in- 
formation processing centres gnd information banks, qh- 



ly if liicse are equipped wilii coinpalibic tlii rcl-geiicriilio'.i 
eomputers. Tlic computers must have si)ecial pcriplierals, 
including a large external memory and CRT displays; they 
must operato in the time-sharing mode. Thereby the op- 
portunity will be opened for connecting logetlier these com- 
puters — thx'-ough the usual communications channels — 
into one m.ified system, hierarchically organized. That sy- 
stem — thanks to teleprinter and CRT displays — will pro- 
vide practically every scientist or practician with a direct 
access to any «information bank». 

The all-union network o\ photocopying ceu'tres for 
scientific documents should have roughly the same locali- 
zation pattern, structure and interaction mechanism. The 
105X148 mm sized microfiche holds a great promise as an 
information medium and it should be broadly introduced 
into the practice of information work. At a reduction ratio 
; of 20: 1, one microfiche holds from 60 to 72 pages, and at 
! a reduction of 60 : 1, as many as 475 pages, that is a who- 
le book of the average size; if the reduction is 150: 1, one 
: microfiche holds up to 3,200 pages, that is 7 or 10 books! 
Such microfiches are usually kept in catalogue drawers, 
where one linear centimetre holds up to 40 of them. 

When s;v.:eial plio'to materials which do not contain sil- 
: ver (e. g. diazofilm) are used, the cost of microfiches is 
; by far lower than that of printed originals, to say nothing 

■ of the costs of their enlarged hard copies. If the users are 
provided with the sufficient number of inexpensive readers 

: and reader-printers, this will bring an efficient solution to 
: many involved problems: the furnishing of the regional 
centres with the complete files of the publications and do- 
cuments they need, publishing certain types of services 
directly on such microfiches, making foreign literaturcby 
far more available 'to the users, finding a drastic solution 
to the storage space problem, etc. 

Networks of «i nforma tion banks» and photocopying 
centres should be supplemented by a nationwide informa- 
: tion referral service, i. e. a network of agencies capable ol 
' givingthe user a cpuck answer as to which centre h.' should 
' refer to for scientific and technical information he needs. 
\ Besides, the existing network of «information analysis cen- 
. tres» should be substantially enlarged — these are bodies 
i created at the leading R&D and D&D institution and inten- 

■ fled to write critical reviews and analytical surveys, dcve- 
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r-ip. 6 Gcnci-al f iinclioiuil scliomc of fhc iiciliomvidc sdciKific and 
teclnmical irifonnaiion netwDrk based on integrated informatiOM 
systems (double arrows show information requests) 

^ „ ^ Abbreviation used 

1^2. Cj — photocopy ins centres 

n?^NTrrsi"!?n^''?"'''o '."''VV-'''' scienfific ;hu1 tccliiiic.'il information 
UlJNin— State Public Sciontific and Tcclitiic;il X,ibriiry 
'/^G— iiiforinntKjn unfilysis CLMitrc 
, ID| — «inf oniiation bitnks* 

JRC — inforni.'ition refurriil centre 
STID— scientific ami tcclinical inforniation tlci)art inent (at an R&D 
or D&D instiititioii) 
ViMITf \^'i"n 7' i'jformnlion bure,iii (at a bijr industrial plnnt) 

V N 111(1^ AII-Unlon R^^^^^^^^^^^ ,f T,,,,,,,^, Inforn,.li.n, Clns- 

^.\^l'n^rr":l^"i'','J ^'^'■^'"'ifi<: Technical InforniMlon Centre 
. t^^PI_Ccntr;il «t"seuicli Institute of Piitcnt Infornuitlgn 



work or scientiti Cl ch iA^Il^^ \''^ -K.lionwide „e!- 
veu on Fig. 6. I'ciinical informalioii agencies is gi- 

infoS,^'n'urpli°es(o7he' ^° l"-°blen, of 

fact (l,at wliile, on lie one l,a I ''^ ''''' »^<^ 
are literally flooded wHh - n SOv^"™<,nl agencies 

: unable to digest boca^^f of Us monsfro^^^^ ^'T "^^^^ ^'"'^ 
versify, on the oilier tlie slnfo ^ volume and di- 

receive the sufficient' amoun. ts of ,^f^.''""j' ''"'^^ 
I^repared and tailored to i°s neech anr f ''°"r """P'^^^h- 

. managerial decision making "^^ in 

\ liniiL°d'numberof'^s'ped"l Infon' """"''^'y '° "P - 
bordinated. respecli^eh' o U e USSpV"''^'^''=^ r''''^' 
Ihe State Plan'ning C^mmi tli!^SH?e"'r °' 
Science and Teclmoloev -iml ('h„ S • ,• Committee lor 
Academy of Sciences Fvni - Prcsiduim of the USSR 

of high^renown"J?,ou,d^be'"nHsred toTa'rth ''"l" ''''''''' 
activities of these centr..^ r ? P^'^^tzipate in the 

wed with the exclusiro auiho tf ^"^o- 
'io the supreme ffovernmont ,^ ^""^ supp yfng information 
iiiusthaveanoppoStv form f^r^Tl ^'^^ letter 

need from the in[3ion 

ation. ^nduon Dodies of any level or affijf- 

To ensure a fnllpt- ,,f;i; i- 
scientific and technical i o °^ available 
O&D projects, it seems sensihli t and 
and cffic oncy of hitrodnc ni- ii, ° ^^^amme the feasibility 
«defence» of a proposed n oFert T/^^ information 
concerned should be in ?hn .^r S mformation agency 
with the propel ent o the nroieH '"''f^^ responsible - tlon| 
a failure to use al avail ,hL -T- "TT?' "^"l consequences of 
mation at the ^tarl of a proiect ''^'^ '"'^ mfor- 



of theVonteX'rary l"JnVonf7,' ^il"''''^^ 

tasks of a considerable increase ^^th^ ^r'^*''"■ '^'"'^ 
arch and design and develoomelii efficiency of rcse- 

fi.ne .aken lo^implemen^tr^^l^i'ltro'f 'tli^tl^'^, °\i*c'l^ 



wo.; n.i ' I ^r" " '^°"'T^cling science and indnslrv iho 
wedNcbt ;iic the links responsible for llie Draclicyl iiiinl,' 
menu, , on of tl,e resuHs of science, for heirmal' scale in 
roduchon into industrial practice. To su mount the d,f' 

sl^o, d '^""^P''^-^' of different niea^ures 
sliould be underlal<en including the constant improvement 

of he performance of ttie natronwide system of scien iffc 
and technical information. For that, it s necessaTy fi^ 
njsh the merger of the information agencies into a system 
of mterconnecte.S and interacting subsystems, wh cli ts 
be based on the principle of spedalization, 1, such a sys- 
ern. the absfractmg and indexing of the «downward» and 
<upward» flo%yf: o, scientific documents will be carried on 
by he network of functional and branch integrate 1 info ' 
afon systems The development of these systems K 
nes In^hl^'^'H we believe, one of the mair^ 

lines m the advancement of the nationwide network of 
scientific and technical information agencies. 
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J. Vinkcn 

(The Netherlands) 

THE ACTIVITIES OF EXCERPTA MEDICA 



Excerpta Medica was founded in 1946 lo face the chal- 
lenge of the world's burgeoning biomedical literature. 
A group of Dutch physicians founded it as the nucleus of 
a reliable information service which would screen, select, 
index, abstract and disseminate, in the English language; 
the international biomedical literature in a most compre- 
hensive way. 

In those early days «a most comprehensive way» en- 
tailed the occupation of two floors above a shoe shop in 
Amsterdam's Kalverstraat. Now Excerpta Medica .and 
its subsidiary groups occupy more than .10 buildings in 
and around Amsterdam, including a computer complex in 
tile suburb of Osdorp. 

To keep pace with the ever-increasing speed of infor- 
mation transmission, and to take full advantage of the 
developments in the nascent computer industry, Excerpta 
Medica established Infonet. Infonet is a fast-growing 
division which spearheads all the progress in automation 
and computerization. It designed and developed the au- 
tomated system for the storage and retrieval of biomedi- 
cal information which was first operated by Excerpta Me- 
dica in 1968. 

Before explaining Excerpta Medica's system in detail, 
it can be indicated how considerably that system has 
evolved during the past 24 years by examining the exis- 
ting Excerpta Medica/Infonet structure, and the shape of 
things to come. 

Infonet has daugliter companies and divisions speciali- 
sing in different areas of study; 

— Dynaflow, which handles the applied mechanics, in- 
cluding software for experimental and theoretical research 
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iJtuI calciilntion in heal transfer, fluid dynamics, and 
one- iincl two-phase riowsystems. 

— Digityp, offers automated tvjjesetting, page mako-iip 
and software for book production. It is the graphical sys- 
tems division. 

— ■ Rescona Eng-ineering-, is an engineering- consultative 
tompaj.y which invents new automated production proces- 
ses, ir.struments for hospitals, etc. 

— IniGjiet Electronics is the hardware division, adapting 
computer equipment and peripherals to meet specific re- 
quirements; it is active in data engineering, cybernetics, 
etc. New -equipment is developed where adaptions tlo not 
suffice. 

— Eltrac is the division which anages the E:icerpta Me 
dice computer center at Osdorp, in Amsterdam. It is staf- 
fed by systems experts, programmers, and hardware engi- 
neers. '■ 

The center is based on an NCR 315 RMC processor, 
with its internal memory extendable to 160 k' and a very 
large magnetic card storage in the form of CRAM-5 units 
(some units are already reserved for Infonet's planned 
on — line networks, job scheduling, for batch processing 
with interchangeable decks, etc.; additional units, are 
available for other real time applications). To extend the 
memory even further, two magnetic tape units, each of 
32 kc, were connected. 

An NCR-32r switching computer has been installed 
to control on-line use of the databanks: up to 100 ter- 
minal connections are possible. 

There are programs interlinking the hardware' com- 
ponents of the configuration, input and output routines, 
retrieval operations — either for specific search requests or 
for the dissemination of cun-ent and/or retrospective 
information — and for the on-line use of the databanks. 
Infonet makes available a series of programs for sub- 
scribers to the Excerpta Medica computer tape services 
enabling them to make Ihe most effective use of the tapes 
in their own computer facilities. 

Infonet's new automated information networks 

Integrated Hospital System; Infonet plans to take thci 
strain off the hospitals in the Netherlands by automating 
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llie transmission of informjiHon within eacli hospiial £in(l 
l)y tcM-niiniil-linking tlic hospitals in a national network. 
All adiiiinistralive work, flassJfieation, patient rccortts, 
files of urj.|-ent data, and medical research will be stream- 
lined. 

So far, Ivifonel has linked its computer ceartrc with a 
visual display terminal at Leiden University IHospntal. It 
has proposed the integration of medical registration files. 
Work is now in progress to interlink the archives of the 
pathological inslilules of Amsterdam, Leiden and Rotter- 
dam, and the TiNO Institute at Rijswijk. (see figure 1 
and i*) . 

Jn^egrated Libraries System; (see figure 3) a group 
headed by the director of the Royal Library at the Hague 
is asses^^ving a system planned by Infonel for an automa- 
ted Dutch national bibliography and documentation depot. 



ExparimentaC Information Hetwor?< ^EM(CRlproject< 




Pig. \. experimental .information network 
«EM/CRI projcct» 

This syst.„^in — the Pilot Project for Integrated Cata- 
logue Automation (PICA) — proposes a new central cata- 
logue which will be consulted by scientific, university and 
general libraries in the Netherlands and abroad by tele- 
communication or interchangeable computer tape. 
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Fig. 3. University library network 
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PICA will be compatible with otlier library systems 
such as the AA/Vi^K li but unlike other systems umier de- 
velopment, which em]iloy sorting techniques, Infonet has 
devised n search pattern allowing for inter-compute/ 
conversation. This will enable the library user to formula- 
te the most precise definition of the publication which he 
seeks, by author, title, subject, form of ])ublication etc. 
Infoiicl's policy in constructing an information system 
calls for the most intensive co-operation of software and 
hardware experts on the one hand, ami the key specialists 
in the jierlinent field of information on the other. 

This ensures that those who will eventually be wor- 
king in co-ordination with the system have participated 
fully at all stages in the planning of that system. 

The effectiveness of this policy was clearly illustrated 
in the development of the Excerpta Medica system, which 
was shaped by continual dialogue between Infonet desig- 
ners and the staff of Excerpta Medica who now work with 
the system (see figure 4). 



i Systems analyst 
User I Programmer 



Information specialist | Computer 

systems design 
Search what li- Store wtiat 



Search how^- i-Store how 



Fig. 4. 
designers and 



Continual ciijiloguc bclwecn Infoncl 
rs and the staff of Excerpta Medica 

Why Exerpta Medica? 

The so-called information explosion has been with us 
a long time. We have had a rapid rise in the number of 
technical and scientific journals since the turn of the 
century. It is particularly interesting that those coun- 
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.tries, such as Japan and the USSR, in wliicb technological 
development had a later onset than the USA and Western 
Europe, are now experiencing- a rapid growth in the pro- 
duction of technical and scienliFic information. 

The biomedical area, with which Excerpta Medica is 
concerned, represents a very large proportion of the whole 
scientific literature field, a relatively constant proportion 
in all countries — contrary to the idea many people may 
have that most journals are technical or in the ptiysical 
sciences. The linguistic problems with which Excerpta 
Medica is concerned — abstracting and translating the 
foreign biomedical literature into Engl'ish — are increasing 
in importance rather than decreasing. Contrary to gene- 
ral belief, English is not becoming moie and more widely 
used as the language of publication, but actuallv reached 
a peak, in terms of proportion of journals, in 1920, since 
when its position has been increasingly eroded by the 
widening use of Russian, Spanish, Japanese and langua- 
ges of the «third world». Briefly, that is the scope of the 
problem which Excerpta Medica is tackling. 

Excerpta Medica is probably best known in medical 
communities throughout the world for its pulilication of 
monthly abstract journals (see figure 5). In addition to 
the 36journals of {he regular series, into which the field 
of bioniedicine lias been divided, another 20 or so «special 
services» are published on more limited subject areas 
such as Fertility Control, Atherosclerosis. Contact Lenses, 
Gram-Negative Infections, Epilepsy, Multiple Sclerosis, 
and the like. All of the regular monthly journals and some 
of the special services arc now «composed» entirely by 
computer, and all the information is stored in the compu- 
terized databank. Whereas the 36 journals of the regular 
series are distributed on subscription, the more highly 
specialized special services are sponsored by various phar- 
maceutical companies, societies or governmental organi- 
zations, and are therefore generally distributed free to 
a select mailing list. The computerized databank, provides 
bibliographies and answers to questions on demand. Ex- 
cerpta Medica also publishes textbooks, symposia, mono- 
graphs and the proceeding of congresses, and has assisted 
in the organization of symposia, provided translation ser- 
vices, and done developmental work in the audiovisual 
area (films and closed-circuit television). 



2)3 



r 




'source journals according to dlsclpiLne 



Fig. 5. Source journals according to discipline 

About 500 full-time staff meml^ers work In the Am- 
sterdam offices, plus the part-lime pliysician-editors, 
in addition to the 700 editorial board members and some 
4000 part-time abstractors who are scattered, all over; 
the world. More than 3500 biomedical periodicals, are re- 
ceived by Excerpta Medica from possibly 15,000 such pe- 
riodicals published in the world; his results in a flow of 
about 100 issues per working day, which yields about; 
250,000 biomedical articles per year. AH of these are clas-f 
sified, indexed, and stored as citations in the databank,-^ 
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and about 60% of them are abstracted (the most signifi- 
cant ones). Each abstract appears, on the averaf^'e, in 
about 1.6 different Excerpta Medica publications. All of 
the journals received are microfilmed in collaboration 
with the . Royal Netherlands Academy of Sciences, resul- 
ting in a microfiche file which now contains about 
20,000,000 pages. 

Each of the 36 regular sections has an intcrnalional 
editorial advisory board, totalling- some 700 physicians 
and other specialists in all, to collaborate with the execu- 
tive chief editors and the editors of the individual sections 
to provide direction and guarantee high standards of 
quality in selection, abstracting and indexing. 

Most sections have more than one editor and some 
have 3 or 4, resulting in a total of about 80 specialists. 
These specialists do all the selection and indexing and 
supervise the content of the abstracts. Excerpta Medica 
has a linguistic problem however which is aggravated by 
the fact that it publishes in English in a non-English- 
speaking country, so that most of the editors lack a na- 

; tive knowledge of that language. Consequently the lin- 
guistic supervisors (usually physicians of English or Ame- 
rican extraction, living either in Amsterdam or in Eng- 
land) edit all texts before publication from a linguistic 
point of view. We also require a corps -of translators to 
translate abstracts that have been prepared in other lan- 

; guages (often the original language of publication, some- 
times Dutch or German) into English. The translation 
bureau, which was originally established to satisfy the 
internal needs, has now expanded into one of the largest 
biomedical translation bureaus in the world, providing 
about half of its 2,000,000 words per year to outside 
clients. One of the characteristic features of' this service 
is that all' work, about 70% of which is into English, is 

! supervised by a second person with a native knowledge 
of the target language. 

In order to separate the very best of the source materi- 
al, the 3500 journals are sorted according to quality. Es- 
sentially, the world's biomedical literature is divided into 
4 roughly equal groups of journals: general medical jour- 
nals which may contain articles on any subject and which 
can be divided into «good» and «mediocre» general jour- 
nals, and speciality journals (also divided into «good» 
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or «iTiediocre») . The speciality journals are divided into 
the 36 current sections of Excerpta Medica. This iniorina- 
tion («Generai» or «Speciality», «Good» or «Mediocre») 
is coded in the databank, a pViorl per journal, so that a 
searcher can restrict his search to a limited serj-ment of 
the biomedical literature if he so dcsii'es. Excerpta Medica 
also provides partial coveraf^e of a number of general sci- 
entific journals, such as Nature and Science, which frequ- 
ently publish articles of a biomedical nature. 

Since there is some inconsistency in the use of the 
term «citation» as it pertains to a scientific article, the 
elements of the citation as used by Excerpta Medica 
should be defined. For every article, there is an Englisii 
title and (unless written in a different orthography) the 
foreign title, codes for the language and country of publi- 
cation, the first three or four authors, the author's institu- 
tional address, the abbreviated name and Coden of the 
periodical, the volume and issue number, pagination and 
year, and the year of input into the databank (this is gi- 
ven by the first two digits of the .production number)/" 

Processing of source journals 

All of the source journals are first microfilmed, cover 
to cover, in conjunction with the Royal Netherlands Aca- 
demy of Sciences, and are stored as microfiche. The jour- 
nals are then brought to the editorial department where 
they are first examined by the assignment editors. These 
assignment editors make a preliminary assignment of 
each original article, including editorials and' letters to 
the editor, to one or more of the 36 sections of Excerpta 
Medica; thus, by writing the section number or numbers 
on the article, they provide what we can call the first de- 
cimal of the classification. The journals are then made 
accessible to all of the section editors for 5 days, during 
which time they can add their section numbers to other 
articles which may not have been assigned to them origi- 
nally. This 5-day period also gives the editors of the spe- 
cial services (sponsored SDI journals) the chance to se- 
lect articles for their 'publications, which are intended to 
be reports of only the «cream» in their selected fields. 
From the preliminary assignment stage, the journals are 
sent to the coding department where the citation of each 
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Uof is 11 1 '-K^;ocl onh, paper lape l,y P]'^-''-"^ J^;^ 
rr, into the compulei- with the sccdoii nunil)ers 1 

0 nn vides an i.Pmedialc print out of eacli day s nipi.t 
&'is llien sent to ll.c proofreading. departnu-rU along 
w ih II e or "inal journals. Any correclions necessary a e 
ma c directly in the computer memory by means of a co - 
ree ion Dro ram after w iich the citations are permanent- 
y^en 3 e Mnt;, the databank and ^^^signed a prod^- . ' ' 
nomher On a weekly basis, the computer pi mis a sciies 
of CO posilc forn,s, \he abstract-index form, for eacn 
citation and each section to which it has been ''^ ti"": • 
eaili nart 0 this form contains the full citation of the ai- 
tie e P u ti e section number. The journals are hen to n 
ap'u-t and each set of abstracl-indc>: forms is clipped . 

the corresponding article .,H iHi=>d are 

These articles with abstract-mdcx toini 
then sent, in turn, to the classification 
of each interested section. The chief edito. of each sec 
Hon beginning with the primary section, , ^ 

bin vs- classify the article inio one or more of the s ib.a- 
' gor es of h'; - section, and decide wd.etlu.- he waiHs^ 
abstract of the article to appear in his monthly 1^"' '^i; 
The article is then passed to the index editor (ii^siial a 
different person), who assigns the primary, secon 
and item-index terms. As soon as tl e 'i'^,; J'^P,^ 

classified and assigned primary indexing ten s by I e 
first section editor, the appropriate form is lo'" " 
sent to the computer, before any work is done onjl^ f ^ 
slract The composite form provides a separate detaUiauie 
shP-^t for classification numbers and primary indexing 
erms fo'r ead'seetion so that these may be -ri en a, 
entered into the databank without awaiting fu the, seco 
darv indexing. If a search is done whue an aiticle is stm 
heinV Drocessed the .searcher at least has a chance ol 
re rilvi^g a cTassified and partly indexed cita ion an 1 - 
er a fulTy-indexed citation, while tne abstract is s ill ■ 
nrenaration The abstract form is then sent, wi h the ai- 
?icle to he editor of the primary section, who is respon 



sible for preparing the abstract. If the primary section 
editor feels that the article is not worth absti'acting, then 
the editor of the next section receives the responsibility. 
Assuming that at least one editor wants an abstract lor 
his section, he sees to its preparations, cither by writing" 
an abstract himself or by haviiio- the article sent to one 
of our .4000 medical abstractors all over the world. In 
many cases the abstract will then have to be translated 
into English, and in most cases the English version will 
have to be checked by a linguistic supervisor. Eventually, 
the entire abstract is also fed into the databank. 



Editorial ]}epartment 



2: 



CompwterData bank 



Citations 



Oripnal article 

Ciassificaiion forms 



MuMiple classifications 



indea* forms 



Index terms 



Abstract forms 



Abstract 



Pl/BLiCATWHS 

Fifj, 6. Pmccssijig of arMcles for input of 
appropriate informa.tion into computer dala bank 

Altlioiigh the classification and indexing- is done in- 
dividually for each section, the same abstract which has 
been prepared for the primary section is used, with a few 
exceptions, for the other sections. At any stage, citation^ 
with or without abstracts can be printed out, as hard 
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copy or on magnetic tape, in response lo scaur, ^^V'f'ts 
When sullicienl abstracts lu.ve l)ecn accumulaleil in a 
section for Ihe monthly journal, the computer automat - 
cany com OSes the pages, assigns abstract 'UHnl^ers n 1 
naee numbers, compiles the author and subject nclcNes, 
nnd ore luces a filn whicli can be sent lo the pruitcr One 
;ri'e ■ a?acl ri tic features of the ENcerpta Mechcal oi^e- 
rat n i he constant interaction between the comput r 
and the editorial staff. This is an example of a ^^"1 lel^'Y 
integrated information storage and •■<=l'-'«;^' 

An information system is only as good as ts m lex, 
whic epresents the handles by which the mformat,on 
M be r lieved If the indexer and the searcher w'e e 
a way the same person, then almost !'"V' itft' tL'nS 
would suffice, but the problem ,s precseUy /.^f ~ 

"i,";^p;::i'lti:;'''oa:::%:n7r;mfe^^^^ 

s-:S^ U^e tu'to" s'-o" Scerpta Medica develope. a^^ 

indexing system which works by means o three " ei^m 

nnroaches- the subject index, the classification system, 
and the em index.' The classification system and . em 
index are a priori, fixed or semi-fi.Ked systems wh, 
the subject index, which provides tne most ^P^'-'f'^/^^_ 
-ievali s a free a posteriori system which 1.3 . two e 
veil -the primary terms and the secondary terms. All in- 
dexing in E.xcerpta Medica is done by medical specm sis 
and the goal is to provide access on either a b oat, o 
highly specific basis, according to the needs of the seal 

cher (see figure 7). 

ismenr mcEssiHo 



Siibiect Indexiffj^ 


Classification 


Item IndexifiQ 


MedicaC speciaCists 
Open! free cfjoice of wcrds 
Two ients of weigH 
Automatic permutationof 

primary terms 
Automatic thesaurus 
lOOOD prtncry concupts 
Sep tt) i no limi ta tions 


Medical speciaUsts 
Open endedi 
Decimal 

PotyHeraxctiicai 

2OO0 categories 
Maximum depth s 5 


Medical speciaUsts 
CBosed 

999 possibiiiiies 

categories 
Maximum dcptf^*10 



Fig, 7. Document processing 



r)J!-r r ""'^ °l each section have specified a detailed 
classification systcin with a possible depth of 4 decimals 

"^\J\Vmr'T^^^^ ^'^^ ^^^tions total 

c bout 3,000 pigeon-holes into which anv article 



class f.ica ions can 'be created rnnvUme is ol - 
erarchic in structure. A maximum 6f 5 d fferen c a i - 
cation numbers can be ^,nven to any article wi hi i each 
sec ion; of these, tlic first determines where the al str^c 
^vl appear m the monthly booklet, but all a?e t^ua ly 

n: nt on 7,n /f''' ^^^^''^^val by means of ciassi- 

ficat on number is a very powerful tool in many cases in 
which tne search question coincides with a subcater/ory 
The Item index is a list of terms which are consicrered 
to be too broad for use as primary index lerms but which 

ic blc to secondary subject-index terms, which are not 
thesaurus-controlled and are therefore genera Iv no ' com 
pu er searchable, to which numbers hLe be n 'sien ' 
The item index is a controlled list of secondary concepts 
with a maximum of 999 numbers, only about 55 of w'fich 
c^.;e now m use; up to 10 numbers cair be assio-ne i to a 

n -V tio s udy>>, «rat». «modeb. which serve to specify a 
or'inrlo";^''"" '^^'ir'^'^on with the appropriate ubj- 
ecMUdex terms or classification numt)ers. The item 
uukx IS mtended purely for computerized retrieval an 
does not appear m the monthly journals 

caily as possible, much more specifically than woud be 

maximal use of the computer while keeping the intellectu- 
al work in the hands of the medical specialists The 

?ns rnntrn)'^^ ^^'^^^^'y ^^^OUt 120,000 thesaU- 

nis-controlled primary terms -each of which is stored 
ni the databank in the form of a 7-digii number ^lus 

IIa~1 VtlTrPnn^' '^.^^^^^ terms to narrow terras - 
and abou 2o0.000 synonyms. These synonyms are also 
.'^J'^^ databank (the thesaurus file or MALIMET) 
so that the indexer or searcher need not consult the the- 
HnanLi f "a^^^al language to express his ideas, 
language which the computer will translate into accepted 
preferred terms. There is no limit to the number of index 
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terms which can lie assi.ci'iictl to an arliclo. The indcxer 
freelv assiti'iis anv sccoiulai-y leniis which inav occur to 
liini, sometimes forming a sort of telegraphic mini-ab- 
stract. 

When it comes lo printing llic sul)ject index in the 
montlily journals, the primary terms are automatically 
rotated so that each appears in its proper place in the 
alphabet, followed by the other primary terms and then 
by all the secondary terms. Thus a search «by hnnd» for 
any (jf the preferred terms \\'\\\ unearth all the other re- 
lated indexing terms. Significantly, all indexing at Ex- 
cerpta Medica is done from the entire source article, not 
just the title or the title with the abstract. 

The synonyiTi problem, a seidous stumbling block for 
any free indexin.g system, has been solved at Excerpta 
Medica by the development of MALIMET, the Master ilist 
of Medical Indexing Terms, a computerized thesaurus 
which is far loo comprehensive to be used by hand (the 
computer print-out is a pile about two meters high) 
but which automatically translates synonyms into prefer-, 
red terms. For example, the word «cerebral tumour», i'n 
the British spelling, is automatically transformed into 
«brain lumor» by the computer. Only «brain tumor» w^ill 
appear in the journals, and only the numerical equivalent 
of «brain tumor» will be stored in the databank. 
MALIMET was originally built up by asking each of the 
section editors to provide a list of primary indexing terms 
for his field, along with as many synonyms as possible. 
After eliminating the internal contradictions, these 
25,000 or so primary terms were then fed into the compu- 
ter with their synonyms. During the next two years, whe- 
never the computer was offered a term which it did not 
recognize, this term appeared on an «error list», which 
was then processed by the specialist index editors. Every 
term on each weekly error list had to be disposed of either 
by entering i as a valid new primary term, or as a syno- 
nym of an existing preferred t-. rm (so that in the future it 
would be translated automatically), or by correctiBg a 
typographical error. In this way (and the process is still 
continuing), the number of preferred terms grew to about 
120,000 and the number of synonyms to about 250,000, 
The only type of preferred term still being added in signi- 
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Ficaiil niiiiil)ers nrc drug names, hiii the sviionvm lota I i.^ 
expected to tvo nhmii 500,000. 

Examining tiie ory-ani/,ation of our conipuler files 
{see figure 8) in a little more detail, let us first look at 



Thesaurus file orgs 


mization 


output filet 


PT number to PT 


0000001 ••■ 


■^PTp 


0000002- 


•-PTk 


0000003- 


-PTa 


0000004- 


-PTe 


0000005 


-PTm 


0000006- • 


-PTh 


input file 2: Any term to PT number 



PTa NoA Ai Aj Ad see also.. 
PTb - rvJo B - B, B, B3 see also.. 
Syn-A, " NoA PTa 

PTc ^ NoC ► C, a see also... 
Syn. B, - Mo B - PTb 
Syn.C, NoC ► PTc 

PTd - No D ^ D, a D. a see also. 



Fig. 8. Thcsnurus file organisa'tion 

the two files required for our thesaurus: an output file in 
which the preferred tsrms are listed in numerical order, 
and an input file in which all preferred terms and syno- 
nyms are listed in alphabetical order, followed by the pre- 
■ferred term (PT) number. How does this operate in 
practice? Any term chosen by the indexer is transformed 
into the proper number and stored in the databank in nu- 
merical form. During the retrieval process, the search 
terms are again translated into numbers, and the nume- 
rical then transformed into analphabetic output by means 
of the output file. 

All of the informaition in the databank is stored in 
three different levels, each representing a different degree 
of accessibility. The first-level or rapid-retrieval file is 
characterized by its short, fixed-length terms,' permitting 
sequential organization and rapid access. This file con- 
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tains the primary and ilem index terms {which, of course, 
are stored in numerical form) plus the classiiicaiion iiunir . 
bers, Codcn code and year of entry, and finally the «add- 
ress», or coded location, of the rest of the citation and 
abstract data in the other files. A search of this file_ is 
rapid but can only yield numerical information, which 
must then be used as the l<ey to obtaining the entire ci- 
tation or abstract from the second and third levels. 
A search for a particular author, since this is not a fi- 
xed-length term, is thus slower than a search by primary 
index entry (sec figures 9 and 10). 

TtiEsaurus: system's supervisor 



output I pt 



output file 

pt numlsrto pt 



retrieval ung term^ 



input file 
any term to pt numier 



Iniexer [any tern 



Hot rtuir 



4pt ffumSerTl 1 pt nuwbers\ 



pL numhef] 



DATA flANK 



Fig. 9. Thesaurus; system's supervisor 

' The organization of the files can be explained in a 
slightly different way as follows. One magnetic card is 
devoted to a sequential address file listing production 
numbers together with the «address» of the citation, index 
terms and abstract for that production number in the 
other files. Another file is a sequential listing of citations 
in numerical order (since the citation is fed in at the time 
that the number is assigned); these citations, however, 
are variable in length. Another file consists of production 
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File organizaiion In data bank 



Primaru index 
Hem Utdex 
Cla^.^sifi^atLun 
Journal cods 
Year cf eniry 
Address 



FIRST LEVlL 

7. lO^'oi/.Tihols per udir 
^5r!0vt^fized tenf^lli items 
^se{iuentL!zUy organized 



Year of entry 

Autlior's name (s) 

Year Of puULcation 

Country Of origin Of article 

Originat languaoe Of article 

Coaen code of pehoiLcai 

Section numier Lndicating medical speciztty 

Chapter namoers in classLfication system 

Primary indexing terms 

Secondary indexing terms 

Hem indexing terms 



SECOND LEVEL 

per ijear 
* ssquentlal or random access 



Complete citnUon 
All indices 
CtassCfCcations 
Abstract 



THIRD LEVEL 

*M. 10^ symbols 
per year 

* randomly organized 

* ICR AM 5 decks 
per year 



Fig. iO. I-iio organisrition in ciata bank 



large 



n urn- 



numbers, listed at random, each followed by all the pri- 
mary index terms (in numerical form) and then by all 
the secondary index terms, and a final file consists of ran- 
dom production numbers each followed by an abstract. 
Note that the index file is not ir»verted, providing for 
much greater economy of operation when a 
ber of searches are run simultaneously. 

The drug literature service or Drugdoc (see figure 11) 
is run in many ways just like any other section of Excerp- 
ta Medica. The section is very large, however (represent- 
ing as much as 25 7o of the whole databank), and no 
abstracts are written specifically for it (although many 
of the articles may be abstracted for other sections). It 
is intended primarily as a computer service (although a 
printed, abridged publication called the Drug Literature 
Index is available). Input into the system for the drug 
project is the following information pertaining to every 
drug and chemical compound cited: the full citation of 



the article; the 



generic 



name of the 



drug 



or compound; 
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Ilio expcriiiUMilal nnnio aiid/tir voik\ if :ni cxpiMinieihlMl 
(iru^^ tlie Inuic, or I.m'liiuI, name; [he cluMiiica! luiine ol 
the druo-; chemical strLiclurc itifi>rmciliL)n (Wiswesser Li- 
ne-- Formula Chemical Notation) ; clinical indications 
and contraindications; clinical .-md pharmacolo^^ical ef- 
fecls; mitoward druo' effects; the name of the nianufactn- 
rer and the location of the manufacturer. This informa- 
lioii is extracted from 'lO^OOU articles a ye^ir and more. 

Brag literalare servLce 



^3,GD0 jounmis 
^^lOOMO iivUcies 

^ selection 



^![},000 articles on dra^s 



IciassLfication 



pharmacDiogy indexing 



cfiemicai indexing^ 
v^iswess er codLngl 



I magnetic tape \ [searclies I j . . , 

L_i ^ Ij ! I ! medical {.ndextng 

L. 



I s.n.i.l- 
I ^ J 




^ advents reactions tLUss (A.R.T.) j 



Fig. 11. Dnirr literiifurc service 



The articles are indexed particularly deeply from three 
points ()\ view by three sets of specialists: medical, phar- 
maceutical and cliemical. M^my chemical journals have 
been added to the collection of biomedical and related 
journals to afford better coverage in this section, th«^ pur- 
pose of which is to include articles describing biobt£>-ical 
effects of chemical agents, particularly drugs or po.it-ntial 
drugs. These articles are classified into 186 pharniia ome- 
dical subgroups, and are then indexed by a physMan, a 
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pharnKicologist, niul a chowhi wlio also provides a Wis- 
\yesser code. For an evtMi more coniprelinisivt' documenta- 
tion of drug literature it has i)(.rn ilecKknl to use a clienii- 
cal fra<^ineiitation code too. The infoi'niation is storeil to- 
j^elher with all other Ilxcerpta Medica iuforniation in the 
(hitabank. where it is available for retrieval in response 
to search requests, and is also available in the form of 
regular computer tape and/or harcl^ copy. 

A special sub-section of the drug project is the Adver- 
se Reaetj()ns Titles, a monthly listing: of indexed classified 
titles for all articles reporting a side-effect of drufj- the- 
rapy. ^ 

• A review of the databank status in June, 1970 showed 
that 213,369 citations for tlie 1969 series had been pro- 
cessed thus far; 91.7% of these had been classified and 
7/.iD% indexed. For the 165,277 classified and indexed 
items, the average nundjor of classifications per citation 
was 2,18 and the average number of primary indexing 
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lerms per citation was 4.9(): 135,183 of these itcMiis wero 
ybstracted. With particular reo-ard to the 32.77G tlru^" 
items in the series, the average number of specifically 
Drugdoc classifications Jind Drugdoc p'rimary indexiiifj^ 
terms per citation is 1.94 aiid 6.05, respectively. Half of 
all items were classified and indexed within about 12 
weeks after input of the citation, while 507o of ^'le abst- 
racting effort was completed in about 22 weeks, (see fi- 
gure 12, Excerpta McMlica's information dissemination 
scheme). 
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W. Piroo- 

(PohiiHl) 

THE USE or COMPUTERS IN SCIENTIFIC, 
TECHNICAL AND ECONOAliC INFORMATION 

IN POLAND 

The problem of using coinpiilers in infomiation pro- 
cessing luivc, for many years, been of vital interest to llic 
Polish information serviee. It is the concern of The Centra! 
Institute for Scientific, Econ(jniic and Technical Infoi-nia- 
tion, a number of branch and plani information cenlres 
as well as of some universities antl research eslablishe- 
ments. Both theoretical as well as experimental and im- 
plementation works are now in progress. Special atten- 
tion has been given to this problem by the Branch Centre 
of Technical and Economic Information at the Inslitule 
of Ferrous Metallurgy in Gliwice, Branch Information 
Centre at the Central Construction and Research Bureau 
of the Shipbuilding Industry in Gdansk, Cliair of Econo- 
mics, Organisation and Planning at the Technical Univer- 
sity in Wroclaw, Information Centre of Electronics and 
Teletechnics UNITECH in Warsaw, Institute of Mathe- 
matical Machines in Warsaw, Institute of Building Tech- 
nology in Warsaw, Ministerial Centre of TechnicaJ and- 
Economic Information in the Ministry of Heavy Industry: 
and National Library. 

The forerunners of the presently developing initiative 
in the use of computers for information processing were; 
the works accomplished as early as in the first half of 
this decade by the Branch Centi-e of Technical and Eco- 
nomic Information at the Institute of Ferrous Metallurgy: 
in Gliwice, Branch Information Centre at the Institute of; 
Zootechnics in Krakow as well as Chair of Economics,! 
Organisation and Pjanning in Technical University in^' 
Wroclaw. " \ 
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The first two centres worked out and iniplemenlod tlio 
system-^ with perforated cards computers used, but the 
Chair used for its worlds the computer «Oi:>RA 100;3» con- 
structed in Pohmd. 

The research work and the tests tliat have, lor seve- 
ral years, been carried on by tlie Information Centre at 
the Institute of Ferrous A\etal]iu-gy in Giiwice, have resul- 
ted in a peculiar svsteiu of infornuition retrieval l)y means 
of perforated card machines of the Hollerith type insta.lled 
in t!ie specialized l-ntreprise of Analytical and Perforated 
Card AAachines for Metallurjry. 

Machine cards have been used to oro-anize the hoUhnns 
of all useful biblioKraphieat units inserted m <'=L^^P';^^ 
Information of Ferrous Melalluro-y and Stecl» pubhshed 
bv the above-mentioned Centre.- A i>-lossary ot most Ire- 
quently occurint^ and useful terms {2 500 words) and n 
code interpreter iiave been prepared for the description 
of documents. The cards are chosen from the collection 
of headin^-s by means of a selector. For these headin^is 
tabula<Trams arc printed with the help of a tabulator tliat 
contains the aala enablino- to indentify the docun^.enl. ^ 

To mechanize partly the information processes m _tnc 
Institute of Zootechnics a set of perforated card machines 
of the APITMA tvpe and teleprinter have been used. I he 
characteristic feature of the system used was also the 
preparation of a i.dossarv of kev words containm-i", m an 
eneodin*^- form, a set of chosen concepts used to describe 
the document. As a milter of fact, the i>-lossary appeal's 
to be a classification system. The efforts that have been 
made to use the UDC symbols have not proved fruitiul 
because of tlie insufficient depth of the UDC classilication 
in the class 639 from the viewpoint of modern zootecli- 

nics. " . , ii T 1 

As far as the patent information is concerned the lecli- 
nical University in Wroclaw has carried out the e.xperi- 
mentai works for the Polish Bureau of Patents, Petroche- 
mical Works at Plock as well as for the Union of Man- 
Made Fibres. The machine ODRA 1003 which has been 
used for these experiments does not possess an external 
memory and therefore could neither be used to form lar^^e 
information collections nor for automatic processinj^ of 
them, but it could be used as a selector of recordings to 
be put down on the -paper tape for making up the simple 
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lists. The necessity to apply as a memory the perforated 
tape and orfranization of the tape files prevent from usinj^ 
this method for practical purposes. Nevertheless, the expe- 
rimental works are being' co'nlimied on information sear- 
diin<? from the collection of patent specifications provided 
with the American patent classification. 

Biblioi^-raphical descriptions of patent specifications 
are be'mgr carried on to a perforated tape. The program 
that is introduced into the machine contains, among other 
things, the table t>f conversion from the American clas- 
sification to the classification used in Poland.. The ma- 
chine reads the perforated tape, and descriptions which 
correspond to a given class used in Poland are writ- 
ten down on the printing device in the form of lists. 

The next centre of the initiative taken on the use 
of computers for information processing is the shipbuil- 
ding industry— The Central Construction and Research 
Bureau of the Shipbuilding fndustry in Gdansk which 
cooperates with the Branch Information Centre. For these 
purposes the computer ELLIOT 803, equipped with the 
tape memory, was used, and the system was given the 
name APIS (Automatyczne Poszukiwanie Informacji Syn- 
tetycznej — Automatic Search for Synietic Information). 
The system was elaborated in 1967 and put into operation 
in 1968, and is designed: 

1. to make up and keep up-to-date the collection of 
bibliographical information 

2. to prepare the topical bibliographical lists on the re- 
quests of users. 

The information is a bibliographical description mar- 
ked with the appriopriate heading of their own branch 
decimal classification. Bibliographical descriptions are put 
down on the card Format A6, and are carried on to a 
paper tape by means of the automatic writing device 
«OPTIMA 528». This tape constitutes the collection which 
brings up to date the contents of the basic collection 
and, besides' it contains the subject matter of the next 
issue of the Information Express— -BIS published by 
the Centre. The primary task of the APIS system is to 
make up topical bibliographical lists, on the basis of 
the existing holdings to meet the user's requests. The 
user's enquiry is being transformed by a specialist from 
the section concerned with cooperation with a computa- 
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lion centre, in accordance witli the rules and terniniology 
for the searchmir prooTam, At tiic same time twenty inqui- 
ries can be introduced into the machine. The program 
enables to restrict . the topical bibliographical list to a 
defitiitc period and definite journals and enables to ob- 
serve these two limitations. It also mahes possil)le to 
print the topical biblio^n-aphical lists according to the 
heading words of branch classification, and after the 
names of the ships. ■ 

The inforniation processing in the APIS syst^:m is 
performed by means of the five programmes: updatmg 
(making the main coliection more complete), the control 
printout of a complcm^ itary collection, making up of the 
topical bibliographical lists on the ground of searches 
done in the main collection, modification ol the main 
collection and subsidiary collections, bringing up-to-date 
the main collection by removal of the outdated items from 
it and from the subsidiary collections and the printout 
of these items. 

The subject of the continued research done by^ the 
Central Construction and Research Bureau of the Ship- 
building Industry in Gdansk is to organize, by exploiting 
computer for information retrieval relevant to the facto- 
graphic data relating, to technical parameters of ships, 
data needed for designing calculations, building techno- 
logy as well as the inforiiiation for menagerial staff. 
The research and experimental works are now in pro- 

.Much attention is given to the automation of nitor- 
mation processes by the institute of Mathematical Ma- 
chines in which the Branch Information Centre develops 
its activities. In 1967 the Institute undertook research and 
preparatory work with the aim to mechanize the publi- 
shing works of the «Information Bulletin» of the Institute 
of Mathematical Machines— Bibiliography, elaboration 
of the subject index, and information retrieval. The subject 
of the publishing work on the «Bulletin» are bibliogra- 
phical descriptions relating to computers, and for marking 
them with their own classification has been adopted. The 
system has been called INBI (Informacja bibliograficz- 
na — Bibliographical Information). The first version oi 
the system has been accomplished by means of the Polish 
mal<e machine ZAM-2I which look the n^me TNB-I-A.- Be- 
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cause of unlypica! classification used and its unique naiu> 
re, the system could not bo widely miide known. 

Ti]e next version of Uie l.\'B I'system is the I.\'BI-B 
\vorl<ed ouf for the Polish iimchinc ZAM-41 which is 
equipped with the units of niag-netic tapes. The l)iblio- 
grtiphical descriptions for this system were enriched 
with" the addiiionfil details rehytino- to the date o\ pub- 
lication, lanouasye symbol and the symbol of the kind 
of a document. The INBI-B syj^tem is of the publishint^ 
nature. The system of searchiui^; prtH^Taniu^es has been 
elaborated for the retrieval of bibliof^'rapliical informa- 
tion which was pjiven the name I.\BIS (Search for Bib- 
liographical Information). The system also functions by 
the use of the machine ZAM-41. The selection of the 
information from. the collection and its preparation in 
the form of topical bibliographical lists can be made 
to answer enquiries which correspond to tlie classi- 
fication part (symbols' part) of the description or to 
their textual parts. As regards tlie last case, descriptions 
are chosen according to the correlation of word sequences 
contained in i:m inquiry with those in the collection. A se- 
arch request formulation is introduced into the machine 
witb the help of a perforated paper tape. The time of 
search done in collections depends on the degree of com- 
plexity of requests. The time of searching of 1000 items 
in the collection for siniple I'cquests with the classifica- 
tion part amounts only to about 1 minute, but for compli- 
cated requests with a widely build textual part the sear- 
ching- lime of 1000 items in the collection is 7—8 times 
^ls the former. The IBIS system is, undoubtedly, very 
interesting, but it could not be wddely used, and therefore 
it ought to be considered experimental as far as the re- 
cording and alphanumeric search are concenied. 

The problem of mechanization and automation of in- 
formation processes were the subject of interest to the 
Central Institute of Scientific, Technical and Economic 
Information as early as the years 1964—65—66. The job 
that was done was of introductory nature and it was 
limited to the research on literature and elaboration of 
the idea aimed at organizing and carrying on the experi- 
ments. Some works being the part of 'the research prob- 
lem were, among other things, carried out within the 
compound research plan which was worked out by the 



StandiiK^ Working Group for Scientific and Teciniical 
fnfclrmatlon aclin,^ withi.l Ihc Council of Mutual hcono- 
niic Cooperalion (SEW). They may l)e repi'dec! as du - 
rdical, concentrated - on the ^^^^'^^^'^Zn ll^l 

formation retrieval languages, researcli on diffcicnt Km Is 
of indexes etc. Tliese works arc contmued. Special aucn- 
tion is'bcino- paid within this o-roup of V^^^'^ 
cr'nlors and thesauri as well as to the ^^l^C pi cii is. 
In I%7 it was resolved by ClINTE to worK out the 
experimental system for one of the central i,,fc)r!nat.on 
lio dino-s run l y the Institute. For these purposes tit 
ent I'l-eq-ister of research pape.-s already comp e e, by 
the Instlkae was chosem On the basis of this P^"^^ '^" 
«Gnide of Scientific and I^esearch Papers, was ccn i mo 
out quarterly. Decision was taken to prepare this lUe- 
reiicc publication by means of a computer as the h s 
staQ-e o{ automation. The cooperation was established wi 
ZOWAR (Zaklad Obliczeniowy ~ Computation Ccntie) m 
Warsaw whi:h disposes of a computation centre and spe- 
cialists-programmers. Then it came 1.0 set ^M^,/; j;:;^^>P 
people that includes the specialists from CilNlL a 
ZOWAR with the aim to work out the project of the 
system. The order for elaboration of appropriate program- 
mes lias been placed with ZOWAR. The most d.aract i- 
stic feature of the system is the use of UDC lor me 
o-rounino- of information in the above-mentioned pubhca- 
tlon The system is called IGA (Informacja Grupowana 
Automatycznie- Information Grouped Automatical y 
and programmed to be realized by the computer lliA\-l4-iu 
(machine with disc memory). About 50 programs were 
elaborated for the system providing it with j^^J?^,^^^^^^^^^ 
sino- program, thus enabling the automatic conti ol between 
tbe'programnics to operate. The publication contains a col- 
lection of data arranged according to the UDC symbols, 
the subiect and ihe-author index and also a_lis of co - 
tents The IGA system came into operation m l^Db, un- 
dergoing different mutations. At present «the Guide» co- 
mes out every month and in includes the bibhographica 
descriptions of ' the completed researcn papers and he 
above-mentioned indexes. The work has now been started 
^ on the reprogramming of the IGA system for a tape 
computer as well as on the programmes designed for sedi- 
ching the required information in a collection. 
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Another initiative taken by CIINTE was Ihe elalyora- 
tion of a project of the system to make u[i. witli tiie help 
of the machine ZAM-41, ttie KWOC index intended lor 
the coiiectitjn of documentation cards relatino- t(j informa- 
tion problems run by CIINTE. Basing- on tiiis colleclion 
CIINTE has, ftjr several years, been puI)iishino- «Przcj^lad 

Pismicnnictwa Zat^^adnien Inf()rmacji» (Revicvv' of L.iiera- 
lurc on Information Problems). In order to project a sys- 
tem a o-roLip of specialists from CIINTE and from the 
Institute of Matliematicai Machines was organized by 
the end of 1967. The project was completed, experiments 

were conducted, and the system was put into operation 

in the third quarter of 1969, and as early a-s 1967 a 
i-eference book consisting- of two parts (main bibliogra- 
phical list and KWOC index) was pid:)lishecl. Each item of 
the bibliographical list is referred to the corresponding item 
of ihe PFZI issue. When accomplishing this project the ma- 
in coilcction of key words was build up. The system can 
be utilized to make up the KWOC index for other col- 
lections of documentary information run by CIINTE 

In 1967 CIINTE undertook ihe initiative in the deve- 
lopment of cooperation with branch information centres on 
the preparation of systems, making the experiments con- 
cerning the use of computers for publishing processes 
and mformation retrieval. Furthermore, CIINTE put for- 
ward the idea of organizing, at separate computer cent- 
res the teams of the branch centres concerned called «the 
clubs of computer users». In 1968 cooperation was under- 
taken With the Information Centre of Electronics and 
lelelexhnics UNITECH which took the initiative as early 
as 1967 to utilize the computer for the works connected 
wi h the publishing of «Current Bibliography» comin^r 
out once a week, and for the search of information in 
the coliection. For the description of documents UNITECH 
prepared their own glossary of descriptors which is cur- 
rently brought up to date. As a result of cooperation bet- 
ween the Information Centre UNITECH CIINTE and 
the Bureau of Research and Projects for 'the System of 
Electronic Data Processing, the printing and retrieval 
.system was accomplished for tiie machine ICT-900 called 
by Its cryptonim ASIA (Automatyczna Seiekcja Informacji 
Adresowane -Automatic Selection, of the Addressed In- 
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formation). The bibliog-raphical description made on a 
special form is, by means of the automatically writinf^ 
device «Optima>> 528, carried on to the eight-channcied 
tape perforated accordinfv to the ABC code. This tape ope- 
rates as a carrier of data tf) be introduced into a com- 
puter. At the output of tiie computer, fo ■ pre|varation of 
the column «Current Information*, a pei forated tape ac- 
cording to the «OPTIMA» code is bo'in^^ obtained. The re- 
cording- of the text i.s performed by the automatic wntini^^ 
device^<OPTIMA» 528. The ASIA system, in its prnituio- 
part, has already been put into operation. The research on 
the elaboration and implementation of the prog-rammcs 
for retrieving the information requested have also been 

completed. , 
In addition to UNITECH, other introductory research 
work connected with the use of a computer in information 
processes is also, for several years, conducted in the Mi- 
nistry of Engineering Industry and Ministry of Heavy 
Industry to meet their own needs. The main stress was 
laid on the elaboration of foundations suited to a new 
form, and contents of bibliographical description by a 
wide application of descriptors. The problem of the appli- 
cation of a computer for information processing was the 
subject matter of training courses in tlie form of confe- 
rence. A number of branch centres are working on this 
problem. These are: the above-mentioned Branch Centre 
at the Institute of Ferrous Metallurgy in Gliwice, Brancli 
Centre at the Industrial Institute of Agricultural Machines 
in Poznan, etc. In 1967, to. develop the research work the 
Centre of technical and Economic Information Processing 
was established under the Ministry of Heavy Industry, In 
1969 the Centre worked out for the machine ICT 900 a 
project of the publishing and retrieval system ASIW to 
handle the information relating to business trips made 
on the initiative of the Ministry of Engineering ami Mi- 
nistry of Heavy Industry. This system has already been 
put into operation. The above-said centre is also preparing 
the foundations for working out of- the SAGO-CZAS sys- 
tem for the machine ICT 1900 in order to print and 
retrieve the information about articles in scientific and 
technical journals. In both systems, subject headings, in 
addition to the bibliographical description, were used to 
describe the content of documents. 
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Ill the yenrs 19GC--1970, CIIN'TE, in cooperaiioii 
Willi tiic Iiislitule of Buildiiif)- 'reclmology worked out 
iiie project of the aLilonifitic retrieval of i^atent infor- 
mation (SA\Vi-2P) for the machine Minsk-22. For ox- 
])erimental pin-puses a collection of patents within tlio 
class 37 (luiihiino- structin-es) aiid 80 (buildino- mate- 
rials) was prciKn-ed. The aim of tlic s>\stein is to solve 
the i^i-oblem of I'ctriex'ing from the information collec- 
tion contained in the memory of a com|:niier the neces- 
sary patent specifications wilii the help of its various 
data (different classifications, formal and legal ele- 
ments and descriptors). 

An introductory work on the elaboration and pub- 
lication of catalog'nes and current bibliography has been 
done -in National Library which cooj:terates with the in- 
stitute of Mathematical Machines and utili/.es ClINTIi's 
experience. 

The. initiative taken in the field of research, desiL^n 
and experimentation to use a comjuiter for information 
proccs-ing* has, for tlie last two years, extended and 
iucludttid new centres and groups of specialists. The pre- 
sent condition may be charactieriz^-d as "'ollows: 

1. So Ir'ir, tile job iaken and wori<s or. the use of com- 
puter- for informatio]! processes in Polaufl ought to 
be considered as introductory to make the groups of 
•sped,.;i:ists familiar \\'itli the proI)leni and to train the 

staff 

2. The cxxperiments conducted will enable to collect ex- 
]:)eriv .ices' whicli will help to work out the projects 
fittcti for a wide ap|Dlication of computers in the na- 
tional system of information, bearing in mind the na- 
tionwide automated system of information 

3. The present research and experimental works have 
been carried on in a dissipated way, in different c. i- 
ditions, with different equipment used, and tlevoicl of 
a common aim and uniform foundations. Diffei'ent car- 
riers, of information, different classifications and in- 
dexing languages ha"/c been used. They may then be 
regarded as introductory and experitnenlal as compa- 
red to the advanced stage of the automated system 
of scientific infoi-mation in the world. 

The forecasts relating to the development of scien- 
tific and technical information for the next 5 years pro- 



vido tlK' cniicrcic nnd siicvossi vo inipleuK-ni nlioii _()f nuln- 
mnlL'd processes for (lie nalioiiwidc system ul scion- 
lific :nul technical ...iornuitic:)n. The fundanienlMl reciin- 
rernenls to l)e put on the system of nuloinaled mlonna- 
tiun is to ensure liie universality of mlormnliuu How 
U results from the designed and realized ors^a n izat i> vna I 
structure of the country information service adaplctti to 
the system of the phnuied economy . 

hi the present manual system informalum tne 

universih flows is assured. In the auioniaied sysien\ li 
makes n- cessary to standardize the documentary entry 
indexino- languaoc and parameters of the equiprnGnt anc 
outfit The starting' point is to coordiiiate research and 
experimental works. These w<H-ks, in compliance witlv ihe 
plans worked out by CIlNTi- and the Plenipotentiary 
of the G'..wernment for Electronic Data Proccssmo-, ^vUi 
l)e develoiioed in the field of aintomated printino- and sear- 
chino- systems for various sciences, in c{)operalion ^vlth 
the Tnfo'rmation centres concerned. As the present fore- 
casts indicate, in the years 1971 — 1973 a project of na- 
tional-wide system of automa:led information wdll be wor- 
ked out, and in the years PJ74— 1975 and in the years 
lo follow gradual implementation for practical purposes 
will be performed. The works that deserve the j^'reatest 
attention are those relating in n'me elaboration of m sub- 
system project for the automateti palent information being 
nin by the Bureau of Patents and CUNTE, the works 
on the automation of printing- processes of central ca- 
talogues, catalogue publications, current bibliooraphy to 
bo done' by The National Library, as well as the works 
on the automation of information in the' field of speciali- 
zed holdings done jointly with the Centre of Economic 
anti Technical Information Processing of the Ministry of 
Heavy Industry and Ministry of Engineering Industry. 
At the same time, alongside of the experimental works 
it is scheduled to conduct the experimental works. 

The integral part of the system designed will be also 
the subsystems of specialized central holdings now organi- 
zed by CIINTE. They consist of information holdings con- 
cerning the completed research papers, translations of 
foreign scientific and technical publications made into 
Polish, topical surveys already prepared, publications 
on information problems, documentation and library sci- 
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enco. The important CMiicrpr. -.{no; will be tke siihsyslem 
of current awarcnuss iniorni;: ;!it)n roicitiny- to liic nrtu-k?s 
in scientific and teciniicaj jo urnals. The important eie- 
nient o.i the systein is tfie i-cEuLral computer centre to he 
set up at the end of the ne^r 5 years in order to make 
possible sorvicino- the central holdino-s of information, 
centralisation of information publications of tlie current 
.'uvaieness type. 

Tlie elaboration and inijilenientation of the project for 
;uito}natcd informaiion on the nation wide scale is not 
limited to the initiative taken in the preparation and rea- 
lization of fractional projects relating- to individual fields. 
However, it is assumed as a: ecjndition to harmonize the 
efforts with the coordina iin.Lj institution — CIINTE. The 
essential condition for the efoioration of the nationl-wtde 
project is the development c*.ri- a lar^-e scale of research 
and cxperimentai works. Tiheirre are difficult proljlerns 
■among them to be solved: cil,; sification systems, indexing 
and information retrieval raguage. Efforts are to be 
made to work out a univer^^^n;-: information retrieval lan- 
guage. 

The project in progress wiill be harmonized with the 
project of inte:mational system .for scientific and technical 
mformation in the countries <s:Touped in the Council of 
Mutual Cooperation which is being worked out by the In- 
ternational Centre of Informaiion in Moscow. In this case 
the coordination is indispen;s;;ible to ensure appriopriate 
cooperation. 

The project should also lake into account the world 
ti-ends in mechanization and automation of information 
processes. It is necessary to present some of them as the 
most important. 

— setting up of large branch automated centres of in- 
formation (printing and retrieval systems) capable of 
providing the users with the encoded information ta- 
pes 

— automatic processing of documents (abstracting) 

— input of many texts into the memory of a computer 
by rapidly working readers of texts 

— storage of a great number of informations in the ex- 
ternal memories of computers 

— utilization of a computer through the transmission data 
network 



— ohihornlion nnd Mp|Tlicnlio ol tiio devices for siorngo 
find prinlin<j; of s>"riiphii.-al information, 

— reeeipl of infonualion from a coniijHiler iiiunediaU'ly on 
to nncrofichcs and inirofilnis 

— searcliint^- in micraforni collections by computer 

— workin^^-" out of a modern retrieval lauf^uaye with uni- 
versal nature 

In the light of the debate held at ne Second Internatio- 
nal Semimar on the UDC and Mechanized Information 
Systems (Frankfurt on Main [—5 June, 1970) the UDC 
could be such a lant^uage after api nate modification. 

Tlie introduction of computers t( . the information sys- 
tem requires of a qualified staff itu be able to carry on 
research and to utilize the automated subsystems whicli 
are put into operation. 
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